Introduction

DCASE-models is an open-source Python library for rapid prototyping of environmental sound analysis systems, with an emphasis on deep–learning models. The project is on GitHub.

It’s main features / design goals are:

  • ease of use,
  • rapid prototyping of environmental sound analysis systems,
  • a simple and lightweight set of basic components that are generally part of a computational environmental audio analysis system,
  • a collection of functions for dataset handling, data preparation, feature extraction, and evaluation (most of which rely on existing tools),
  • a model interface to standardize the interaction of machine learning methods with the other system components,
  • an abstraction layer to make the library independent of the backend used to implement the machine learning model,
  • inclusion of reference implementations for several state-of-the-art algorithms.

DCASE-models is a work in progress, thus input is always welcome.

The available documentation is limited for now, but you can help to improve it.