CFGPT: Chinese Financial Assistant with Large Language Model

09/19/2023
by   Jiangtong Li, et al.
0

Large language models (LLMs) have demonstrated great potential in natural language processing tasks within the financial domain. In this work, we present a Chinese Financial Generative Pre-trained Transformer framework, named CFGPT, which includes a dataset (CFData) for pre-training and supervised fine-tuning, a financial LLM (CFLLM) to adeptly manage financial texts, and a deployment framework (CFAPP) designed to navigate real-world financial applications. The CFData comprising both a pre-training dataset and a supervised fine-tuning dataset, where the pre-training dataset collates Chinese financial data and analytics, alongside a smaller subset of general-purpose text with 584M documents and 141B tokens in total, and the supervised fine-tuning dataset is tailored for six distinct financial tasks, embodying various facets of financial analysis and decision-making with 1.5M instruction pairs and 1.5B tokens in total. The CFLLM, which is based on InternLM-7B to balance the model capability and size, is trained on CFData in two stage, continued pre-training and supervised fine-tuning. The CFAPP is centered on large language models (LLMs) and augmented with additional modules to ensure multifaceted functionality in real-world application. Our codes are released at https://github.com/TongjiFinLab/CFGPT.

READ FULL TEXT

page 4

page 7

research
02/18/2023

BBT-Fin: Comprehensive Construction of Chinese Financial Domain Pre-trained Language Model, Corpus and Benchmark

To advance Chinese financial natural language processing (NLP), we intro...
research
04/17/2023

Efficient and Effective Text Encoding for Chinese LLaMA and Alpaca

Large Language Models (LLMs), such as ChatGPT and GPT-4, have revolution...
research
05/19/2023

XuanYuan 2.0: A Large Chinese Financial Chat Model with Hundreds of Billions Parameters

In recent years, pre-trained language models have undergone rapid develo...
research
08/31/2023

YaRN: Efficient Context Window Extension of Large Language Models

Rotary Position Embeddings (RoPE) have been shown to effectively encode ...
research
07/31/2023

FinVis-GPT: A Multimodal Large Language Model for Financial Chart Analysis

In this paper, we propose FinVis-GPT, a novel multimodal large language ...
research
04/25/2022

Super-Prompting: Utilizing Model-Independent Contextual Data to Reduce Data Annotation Required in Visual Commonsense Tasks

Pre-trained language models have shown excellent results in few-shot lea...
research
10/25/2022

Learning Better Intent Representations for Financial Open Intent Classification

With the recent surge of NLP technologies in the financial domain, banks...

Please sign up or login with your details

Forgot password? Click here to reset