DBpedia Dataset


wget https://data.deepai.org/dbpedia.zip
DBpedia is aimed towards extracting structured content from Wikipedia. This is a data extract (after preprocessing, with kernel included) with taxonomic, hierarchical categories, or classes, for ~343k Wikipedia articles. There are 3 levels with 9, 70, and 219 classes. A version of this dataset is also a popular baseline for NLP/text classification tasks.