Advances in Large Language Models (LLMs) have inspired a surge of resear...
We present a dataset generator engine named Web-based Visual Corpus Buil...
Semi-structured query systems for document-oriented databases have many ...
Understanding document images (e.g., invoices) has been an important res...
It is well-known that typical word embedding methods such as Word2Vec an...
A real-world information extraction (IE) system for semi-structured docu...
Multimodal relational data analysis has become of increasing importance ...
Many new proposals for scene text recognition (STR) models have been
int...
We propose weighted inner product similarity (WIPS) for
neural-network b...
We propose shifted inner-product similarity (SIPS), which is a novel yet...
Applying conventional word embedding models to unsegmented languages, wh...