Text mining policy: Classifying forest and landscape restoration policy agenda with neural information retrieval

08/07/2019
by   John Brandt, et al.
0

Dozens of countries have committed to restoring the ecological functionality of 350 million hectares of land by 2030. In order to achieve such wide-scale implementation of restoration, the values and priorities of multi-sectoral stakeholders must be aligned and integrated with national level commitments and other development agenda. Although misalignment across scales of policy and between stakeholders are well known barriers to implementing restoration, fast-paced policy making in multi-stakeholder environments complicates the monitoring and analysis of governance and policy. In this work, we assess the potential of machine learning to identify restoration policy agenda across diverse policy documents. An unsupervised neural information retrieval architecture is introduced that leverages transfer learning and word embeddings to create high-dimensional representations of paragraphs. Policy agenda labels are recast as information retrieval queries in order to classify policies with a cosine similarity threshold between paragraphs and query embeddings. This approach achieves a 0.83 F1-score measured across 14 policy agenda in 31 policy documents in Malawi, Kenya, and Rwanda, indicating that automated text mining can provide reliable, generalizable, and efficient analyses of restoration policy.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/26/2021

A data-driven strategy to combine word embeddings in information retrieval

Word embeddings are vital descriptors of words in unigram representation...
research
11/10/2021

Cross-language Information Retrieval

Two key assumptions shape the usual view of ranked retrieval: (1) that t...
research
01/08/2022

Beyond modeling: NLP Pipeline for efficient environmental policy analysis

As we enter the UN Decade on Ecosystem Restoration, creating effective i...
research
07/13/2020

A Feature Analysis for Multimodal News Retrieval

Content-based information retrieval is based on the information containe...
research
06/05/2020

Balancing Reinforcement Learning Training Experiences in Interactive Information Retrieval

Interactive Information Retrieval (IIR) and Reinforcement Learning (RL) ...
research
10/05/2022

IRJIT – An Information Retrieval Technique for Just-in-time Defect Identification

Defect identification at commit check-in time prevents the introduction ...
research
01/13/2020

On the Replicability of Combining Word Embeddings and Retrieval Models

We replicate recent experiments attempting to demonstrate an attractive ...

Please sign up or login with your details

Forgot password? Click here to reset