Writing Across the World's Languages: Deep Internationalization for Gboard, the Google Keyboard

12/03/2019
by   Daan van Esch, et al.
0

This technical report describes our deep internationalization program for Gboard, the Google Keyboard. Today, Gboard supports 900+ language varieties across 70+ writing systems, and this report describes how and why we have been adding support for hundreds of language varieties from around the globe. Many languages of the world are increasingly used in writing on an everyday basis, and we describe the trends we see. We cover technological and logistical challenges in scaling up a language technology product like Gboard to hundreds of language varieties, and describe how we built systems and processes to operate at scale. Finally, we summarize the key take-aways from user studies we ran with speakers of hundreds of languages from around the world.

READ FULL TEXT
POST COMMENT

Comments

There are no comments yet.

Authors

page 1

page 2

page 3

page 4

11/02/2020

Automated Transcription of Non-Latin Script Periodicals: A Case Study in the Ottoman Turkish Print Archive

Our study utilizes deep learning methods for the automated transcription...
11/05/2018

A human-editable Sign Language representation for software editing---and a writing system?

To equip SL with software properly, we need an input system to represent...
03/06/2018

|Math, Philosophy, Programming, Writing| = 1

Philosophical thinking has a side effect: by aiming to find the essence ...
05/15/2020

Monads and "do" notation in the Wolfram Language

This paper describes a categorical interpretation of the Wolfram Languag...
07/11/2020

Is Machine Learning Speaking my Language? A Critical Look at the NLP-Pipeline Across 8 Human Languages

Natural Language Processing (NLP) is increasingly used as a key ingredie...
05/05/2020

Digraph of Senegal s local languages: issues, challenges and prospects of their transliteration

The local languages in Senegal, like those of West African countries in ...
11/30/2021

Challenges in Developing LRs for Non-Scheduled Languages: A Case of Magahi

Magahi is an Indo-Aryan Language, spoken mainly in the Eastern parts of ...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.