Integration of Japanese Papers Into the DBLP Data Set

If someone is looking for a certain publication in the field of computer science, the searching person is likely to use the DBLP to find the desired publication. The DBLP data set is continuously extended with new publications, or rather their metadata, for example the names of involved authors, the title and the publication date. While the size of the data set is already remarkable, specific areas can still be improved. The DBLP offers a huge collection of English papers because most papers concerning computer science are published in English. Nevertheless, there are official publications in other languages which are supposed to be added to the data set. One kind of these are Japanese papers. This diploma thesis will show a way to automatically process publication lists of Japanese papers and to make them ready for an import into the DBLP data set. Especially important are the problems along the way of processing, such as transcription handling and Personal Name Matching with Japanese names.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/23/2022

Gender Representation in Brazilian Computer Science Conferences

This study presents an automated bibliometric analysis of 6569 research ...
research
09/05/2019

Author Growth Outstrips Publication Growth in Computer Science and Publication Quality Correlates with Collaboration

Although the computer science community successfully harnessed exponenti...
research
08/01/2018

Reassembling the English novel, 1789-1919

Sociologically-inclined literary history foundered in the 20th century d...
research
11/20/2019

Do top conferences contain well cited papers or junk?

In order to answer questions about top conference publication patterns, ...
research
10/14/2022

The State of Profanity Obfuscation in Natural Language Processing

Work on hate speech has made the consideration of rude and harmful examp...
research
09/07/2022

Biblio-Analysis of Cohort Intelligence (CI) Algorithm and its allied applications from Scopus and Web of Science Perspective

Cohort Intelligence or CI is one of its kind of novel optimization algor...
research
04/19/2022

Sharing and Caring: Creating a Culture of Constructive Criticism in Computational Legal Studies

We introduce seven foundational principles for creating a culture of con...

Please sign up or login with your details

Forgot password? Click here to reset