FlagIt: A System for Minimally Supervised Human Trafficking Indicator Mining

12/05/2017
by   Mayank Kejriwal, et al.
0

In this paper, we describe and study the indicator mining problem in the online sex advertising domain. We present an in-development system, FlagIt (Flexible and adaptive generation of Indicators from text), which combines the benefits of both a lightweight expert system and classical semi-supervision (heuristic re-labeling) with recently released state-of-the-art unsupervised text embeddings to tag millions of sentences with indicators that are highly correlated with human trafficking. The FlagIt technology stack is open source. On preliminary evaluations involving five indicators, FlagIt illustrates promising performance compared to several alternatives. The system is being actively developed, refined and integrated into a domain-specific search system used by over 200 law enforcement agencies to combat human trafficking, and is being aggressively extended to mine at least six more indicators with minimal programming effort. FlagIt is a good example of a system that operates in limited label settings, and that requires creative combinations of established machine learning techniques to produce outputs that could be used by real-world non-technical analysts.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/04/2023

Towards a Flexible User Interface for 'Quick and Dirty' Learning Analytics Indicator Design

Research on Human-Centered Learning Analytics (HCLA) has provided demons...
research
09/21/2023

A review of troubled cell indicators for discontinuous Galerkin method

In this paper, eight different troubled cell indicators (shock detectors...
research
12/28/2020

Phishing Detection through Email Embeddings

The problem of detecting phishing emails through machine learning techni...
research
11/24/2021

Mining Meta-indicators of University Ranking: A Machine Learning Approach Based on SHAP

University evaluation and ranking is an extremely complex activity. Majo...
research
05/01/2020

The Hypervolume Indicator: Problems and Algorithms

The hypervolume indicator is one of the most used set-quality indicators...
research
06/03/2019

Mining Data from the Congressional Record

We propose a data storage and analysis method for using the US Congressi...

Please sign up or login with your details

Forgot password? Click here to reset