Automatic Analysis of Available Source Code of Top Artificial Intelligence Conference Papers

09/28/2022
by   Jialiang Lin, et al.
0

Source code is essential for researchers to reproduce the methods and replicate the results of artificial intelligence (AI) papers. Some organizations and researchers manually collect AI papers with available source code to contribute to the AI community. However, manual collection is a labor-intensive and time-consuming task. To address this issue, we propose a method to automatically identify papers with available source code and extract their source code repository URLs. With this method, we find that 20.5 regular papers of 10 top AI conferences published from 2010 to 2019 are identified as papers with available source code and that 8.1 code repositories are no longer accessible. We also create the XMU NLP Lab README Dataset, the largest dataset of labeled README files for source code document research. Through this dataset, we have discovered that quite a few README files have no installation instructions or usage tutorials provided. Further, a large-scale comprehensive statistical analysis is made for a general picture of the source code of AI conference papers. The proposed solution can also go beyond AI conference papers to analyze other scientific papers from both journals and conferences to shed light on more domains.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/16/2019

An Explorative Study of GitHub Repositories of AI Papers

With the rapid development of AI technologies, thousands of AI papers ar...
research
04/20/2023

A Study on Reproducibility and Replicability of Table Structure Recognition Methods

Concerns about reproducibility in artificial intelligence (AI) have emer...
research
11/09/2017

DLPaper2Code: Auto-generation of Code from Deep Learning Research Papers

With an abundance of research papers in deep learning, reproducibility o...
research
12/29/2021

Working mechanism of Eternalblue and its application in ransomworm

After the leaking of exploit Eternalblue, some ransomworms utilizing thi...
research
11/04/2022

The Sustainable Development Goals and Aerospace Engineering: A critical note through Artificial Intelligence

The 2030 Agenda of the United Nations (UN) revolves around the Sustainab...
research
06/15/2023

Are ChatGPT and Other Similar Systems the Modern Lernaean Hydras of AI?

The rise of Generative Artificial Intelligence systems (“AI systems”) ha...
research
05/26/2021

Benchmarking Scientific Image Forgery Detectors

The scientific image integrity area presents a challenging research bott...

Please sign up or login with your details

Forgot password? Click here to reset