GLM for partially pooled categorical predictors with a case study in biosecurity

11/25/2022
by   Christopher M. Baker, et al.
0

National governments use border information to efficiently manage the biosecurity risk presented by travel and commerce. In the Australian border biosecurity system, data about cargo consignments are collected from records of directions: that is, the records of actions taken by the biosecurity regulator. This data collection is complicated by the way directions for a given entry are recorded. An entry is a collection of import lines where each line is a single type of item or commodity. Analysis is simple when the data are recorded in line mode: the directions are recorded individually for each line. The challenge comes when data are recorded in container mode, because the same direction is recorded against each line in the entry. In other words, if at least one line in an entry has a non-compliant inspection result, then all lines in that entry are recorded as non-compliant. Therefore, container mode data creates a challenge for estimating the probability that certain items are non-compliant, because matching the records of non-compliance to the line information is impossible. We develop a statistical model to use container mode data to help inform biosecurity risk of items. We use asymptotic analysis to estimate the value of container mode data compared to line mode data, do a simulation study to verify that we can accurately estimate parameters in a large dataset, and we apply our methods to a real dataset, for which important information about the risk of non-compliance is recovered using the new model.

READ FULL TEXT
research
10/10/2021

Re-entry prediction and demisability analysis for the atmospheric disposal of geosynchronous satellites

The paper presents a re-entry analysis of Geosynchronous Orbit (GSO) sat...
research
05/21/2021

WildKey: A Privacy-Aware Keyboard Toolkit for Data Collection In-The-Wild

Touch data, and in particular text-entry data, has been mostly collected...
research
06/14/2023

A Unified Probabilistic Framework for Spatiotemporal Passenger Crowdedness Inference within Urban Rail Transit Network

This paper proposes the Spatio-Temporal Crowdedness Inference Model (STC...
research
11/23/2019

Deep learning reconstruction of ultrashort pulses from 2D spatial intensity patterns recorded by an all-in-line system in a single-shot

We propose a simple all-in-line single-shot scheme for diagnostics of ul...
research
12/04/2022

The flexible Gumbel distribution: A new model for inference about the mode

A new unimodal distribution family indexed by the mode and three other p...
research
09/26/2019

New Attacks and Defenses for Randomized Caches

The last level cache is vulnerable to timing based side channel attacks ...
research
09/13/2023

Adaptive sampling method to monitor low-risk pathways with limited surveillance resources

The rise of globalisation has led to a sharp increase in international t...

Please sign up or login with your details

Forgot password? Click here to reset