OtoWorld: Towards Learning to Separate by Learning to Move

07/12/2020
by   Omkar Ranadive, et al.
0

We present OtoWorld, an interactive environment in which agents must learn to listen in order to solve navigational tasks. The purpose of OtoWorld is to facilitate reinforcement learning research in computer audition, where agents must learn to listen to the world around them to navigate. OtoWorld is built on three open source libraries: OpenAI Gym for environment and agent interaction, PyRoomAcoustics for ray-tracing and acoustics simulation, and nussl for training deep computer audition models. OtoWorld is the audio analogue of GridWorld, a simple navigation game. OtoWorld can be easily extended to more complex environments and games. To solve one episode of OtoWorld, an agent must move towards each sounding source in the auditory scene and "turn it off". The agent receives no other input than the current sound of the room. The sources are placed randomly within the room and can vary in number. The agent receives a reward for turning off a source. We present preliminary results on the ability of agents to win at OtoWorld. OtoWorld is open-source and available.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/25/2021

A Deep Reinforcement Learning Approach for Audio-based Navigation and Audio Source Localization in Multi-speaker Environments

In this work we apply deep reinforcement learning to the problems of nav...
research
06/17/2022

MineDojo: Building Open-Ended Embodied Agents with Internet-Scale Knowledge

Autonomous agents have made great strides in specialist domains like Ata...
research
11/13/2018

Open-source platforms for fast room acoustic simulations in complex structures

This article presents new numerical simulation tools, respectively devel...
research
11/29/2017

HoME: a Household Multimodal Environment

We introduce HoME: a Household Multimodal Environment for artificial age...
research
10/13/2021

Extending Environments To Measure Self-Reflection In Reinforcement Learning

We consider an extended notion of reinforcement learning in which the en...
research
06/21/2021

Cogment: Open Source Framework For Distributed Multi-actor Training, Deployment Operations

Involving humans directly for the benefit of AI agents' training is gett...
research
05/10/2021

A Deep Reinforcement Learning Approach to Audio-Based Navigation in a Multi-Speaker Environment

In this work we use deep reinforcement learning to create an autonomous ...

Please sign up or login with your details

Forgot password? Click here to reset