Classifying Patents Based on their Semantic Content

12/27/2016
by   Antonin Bergeaud, et al.
0

In this paper, we extend some usual techniques of classification resulting from a large-scale data-mining and network approach. This new technology, which in particular is designed to be suitable to big data, is used to construct an open consolidated database from raw data on 4 million patents taken from the US patent office from 1976 onward. To build the pattern network, not only do we look at each patent title, but we also examine their full abstract and extract the relevant keywords accordingly. We refer to this classification as semantic approach in contrast with the more common technological approach which consists in taking the topology when considering US Patent office technological classes. Moreover, we document that both approaches have highly different topological measures and strong statistical evidence that they feature a different model. This suggests that our method is a useful tool to extract endogenous information.

READ FULL TEXT

page 10

page 35

page 36

page 37

page 38

page 39

page 40

research
07/31/2023

A new mapping of technological interdependence

Which technological linkages affect the sector's ability to innovate? Ho...
research
01/03/2018

The Unified Astronomy Thesaurus: Semantic Metadata for Astronomy and Astrophysics

Several different controlled vocabularies have been developed and used b...
research
06/08/2021

Defining definition: a Text mining Approach to Define Innovative Technological Fields

One of the first task of an innovative project is delineating the scope ...
research
12/03/2017

Exploration of an Interdisciplinary Scientific Landscape

Patterns of interdisciplinarity in science can be quantified through div...
research
11/05/2018

Data Integration for Supporting Biomedical Knowledge Graph Creation at Large-Scale

In recent years, following FAIR and open data principles, the number of ...
research
12/03/2016

Mining Spatio-temporal Data on Industrialization from Historical Registries

Despite the growing availability of big data in many fields, historical ...
research
08/31/2019

Mapping Firms' Locations in Technological Space: A Topological Analysis of Patent Statistics

Where do firms innovate? Mapping their locations in technological space ...

Please sign up or login with your details

Forgot password? Click here to reset