JIANG: Chinese Open Foundation Language Model

08/01/2023
by   Qinhua Duan, et al.
0

With the advancements in large language model technology, it has showcased capabilities that come close to those of human beings across various tasks. This achievement has garnered significant interest from companies and scientific research institutions, leading to substantial investments in the research and development of these models. While numerous large models have emerged during this period, the majority of them have been trained primarily on English data. Although they exhibit decent performance in other languages, such as Chinese, their potential remains limited due to factors like vocabulary design and training corpus. Consequently, their ability to fully express their capabilities in Chinese falls short. To address this issue, we introduce the model named JIANG (Chinese pinyin of ginger) specifically designed for the Chinese language. We have gathered a substantial amount of Chinese corpus to train the model and have also optimized its structure. The extensive experimental results demonstrate the excellent performance of our model.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/21/2022

StyleBERT: Chinese pretraining by font style information

With the success of down streaming task using English pre-trained langua...
research
03/03/2020

CLUECorpus2020: A Large-scale Chinese Corpus for Pre-training Language Model

In this paper, we introduce the Chinese corpus from CLUE organization, C...
research
03/03/2020

CLUECorpus2020: A Large-scale Chinese Corpus for Pre-trainingLanguage Model

In this paper, we introduce the Chinese corpus from CLUE organization, C...
research
01/13/2023

In BLOOM: Creativity and Affinity in Artificial Lyrics and Art

We apply a large multilingual language model (BLOOM-176B) in open-ended ...
research
07/27/2023

SuperCLUE: A Comprehensive Chinese Large Language Model Benchmark

Large language models (LLMs) have shown the potential to be integrated i...
research
04/16/2021

A Masked Segmental Language Model for Unsupervised Natural Language Segmentation

Segmentation remains an important preprocessing step both in languages w...
research
02/16/2023

Reanalyzing L2 Preposition Learning with Bayesian Mixed Effects and a Large Language Model

We use both Bayesian and neural models to dissect a data set of Chinese ...

Please sign up or login with your details

Forgot password? Click here to reset