Whose Text Is It Anyway? Exploring BigCode, Intellectual Property, and Ethics

04/06/2023
by   Madiha Zahrah Choksi, et al.
0

Intelligent or generative writing tools rely on large language models that recognize, summarize, translate, and predict content. This position paper probes the copyright interests of open data sets used to train large language models (LLMs). Our paper asks, how do LLMs trained on open data sets circumvent the copyright interests of the used data? We start by defining software copyright and tracing its history. We rely on GitHub Copilot as a modern case study challenging software copyright. Our conclusion outlines obstacles that generative writing assistants create for copyright, and offers a practical road map for copyright analysis for developers, software law experts, and general users to consider in the context of intelligent LLM-powered writing tools.

READ FULL TEXT

page 1

page 2

page 3

research
03/27/2023

LMCanvas: Object-Oriented Interaction to Personalize Large Language Model-Powered Writing Environments

Large language models (LLMs) can enhance writing by automating or suppor...
research
03/28/2023

Writing Assistants Should Model Social Factors of Language

Intelligent writing assistants powered by large language models (LLMs) a...
research
04/06/2023

Approach Intelligent Writing Assistants Usability with Seven Stages of Action

Despite the potential of Large Language Models (LLMs) as writing assista...
research
09/14/2023

ChatGPT v Bard v Bing v Claude 2 v Aria v human-expert. How good are AI chatbots at scientific writing? (ver. 23Q3)

Historically, proficient writing was deemed essential for human advancem...
research
04/10/2022

Is GitHub's Copilot as Bad As Humans at Introducing Vulnerabilities in Code?

Several advances in deep learning have been successfully applied to the ...
research
07/10/2023

Can Large Language Models Write Good Property-Based Tests?

Property-based testing (PBT), while an established technique in the soft...
research
04/19/2023

Towards Objective-Tailored Genetic Improvement Through Large Language Models

While Genetic Improvement (GI) is a useful paradigm to improve functiona...

Please sign up or login with your details

Forgot password? Click here to reset