HouYi: An open-source large language model specially designed for renewable energy and carbon neutrality field

07/31/2023
by   Mingliang Bai, et al.
0

Renewable energy is important for achieving carbon neutrality goal. With the great success of Large Language Models (LLMs) like ChatGPT in automatic content generation, LLMs are playing an increasingly important role. However, there has not been a specially designed LLM for renewable energy. Meanwhile, there has not been any dataset of renewable energy for training LLMs. Therefore, this paper published the first open-source Renewable Energy Academic Paper (REAP) dataset for non-commercial LLM research of renewable energy. REAP dataset is collected through searching the title and abstract of 1,168,970 academic literatures from Web of Science. Based on REAP dataset, HouYi model, the first LLM for renewable energy, is developed through finetuning general LLMs. HouYi demonstrated powerful academic paper paragraph generation ability in renewable energy field. Experiments show that its ability to generate academic papers on renewable energy is comparable to ChatGPT, slightly outperforms Claude, ERNIE Bot and SparkDesk, and significantly outperforms open-source LLaMA-13B model.

READ FULL TEXT
research
03/10/2023

Algorithmic Ghost in the Research Shell: Large Language Models and Academic Knowledge Creation in Management Research

The paper looks at the role of large language models in academic knowled...
research
12/02/2019

Neural Academic Paper Generation

In this work, we tackle the problem of structured text generation, speci...
research
09/21/2023

MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models

Large language models (LLMs) have pushed the limits of natural language ...
research
04/24/2023

CHEAT: A Large-scale Dataset for Detecting ChatGPT-writtEn AbsTracts

The powerful ability of ChatGPT has caused widespread concern in the aca...
research
07/04/2023

Can We Mathematically Spot Possible Manipulation of Results in Research Manuscripts Using Benford's Law?

The reproducibility of academic research has long been a persistent issu...
research
12/08/2022

DECO2 An Open-source Energy System Decarbonisation Planning Software Including Negative Emissions Technologies

The deployment of CO2 capture and storage (CCS) and negative emissions t...
research
08/22/2023

Halo: Estimation and Reduction of Hallucinations in Open-Source Weak Large Language Models

Large Language Models (LLMs) have revolutionized Natural Language Proces...

Please sign up or login with your details

Forgot password? Click here to reset