midas-research / audino
- воскресенье, 14 июня 2020 г. в 00:21:45
JavaScript
Open source audio annotation tool for humans™
audino is an open source audio annotation tool. It provides features such as transcription and labeling which enables annotation for Voice Activity Detection (VAD), Diarization, Speaker Identification, Automated Speech Recognition, Emotion Recognition tasks and more.
Current features of the tool include:
Note: Please see getting started guide for configurations and concrete usage.
Please install the following dependencies to run audino on your system:
$ git clone https://github.com/midas-research/audino.git
$ cd audinoYou can either run the project on default configuration or modify them to your need.
To build the services, run:
$ docker-compose -f docker-compose.prod.yml buildTo bring up the services, run:
$ docker-compose -f docker-compose.prod.yml upThen, in browser, go to http://0.0.0.0/ to view the application.
To bring down the services, run:
$ docker-compose -f docker-compose.prod.yml downSimilar to production setup, you need to use development configuration for working on the project, fixing bugs and making contributions.
To build the services, run:
$ docker-compose -f docker-compose.dev.yml buildTo bring up the services, run:
$ docker-compose -f docker-compose.dev.yml upThen, in browser, go to http://localhost:3000/ to view the application.
To bring down the services, run:
$ docker-compose -f docker-compose.dev.yml downWe provide a set of tutorials to guide users to achieve certain tasks. If you feel something is missing and should be included, please open an issue.
Currently, the paper is under review. For now, please cite it as:
@misc{grover2020audino,
title={audino: A Modern Annotation Tool for Audio and Speech},
author={Manraj Singh Grover and Pakhi Bamdev and Yaman Kumar and Mika Hama and Rajiv Ratn Shah},
year={2020},
eprint={2006.05236},
archivePrefix={arXiv},
primaryClass={cs.SD}
}
MIT © MIDAS, IIIT Delhi