Arukikata Travelogue Dataset

05/19/2023
by   Hiroki Ouchi, et al.
0

We have constructed Arukikata Travelogue Dataset and released it free of charge for academic research. This dataset is a Japanese text dataset with a total of over 31 million words, comprising 4,672 Japanese domestic travelogues and 9,607 overseas travelogues. Before providing our dataset, there was a scarcity of widely available travelogue data for research purposes, and each researcher had to prepare their own data. This hinders the replication of existing studies and fair comparative analysis of experimental results. Our dataset enables any researchers to conduct investigation on the same data and to ensure transparency and reproducibility in research. In this paper, we describe the academic significance, characteristics, and prospects of our dataset.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/23/2022

Cem Mil Podcasts: A Spoken Portuguese Document Corpus

This document describes the Portuguese language podcast dataset released...
research
03/09/2016

Lexical bundles in computational linguistics academic literature

In this study we analyzed a corpus of 8 million words academic literatur...
research
07/06/2020

Incorrect Data in the Widely Used Inside Airbnb Dataset

Several recently published papers in Decision Support Systems discussed ...
research
03/23/2021

A large-scale study on research code quality and execution

This article presents a study on the quality and execution of research c...
research
08/12/2020

The network footprint of replication in popular DBMSs

Database replication is an important component of reliable, disaster tol...
research
01/31/2021

Predicting replicability – analysis of survey and prediction market data from large-scale forecasting projects

The reproducibility of published research has become an important topic ...
research
07/04/2023

Can We Mathematically Spot Possible Manipulation of Results in Research Manuscripts Using Benford's Law?

The reproducibility of academic research has long been a persistent issu...

Please sign up or login with your details

Forgot password? Click here to reset