Speech and Language Processing System for Eurobin Coopetition

This repository implements a ROS-based speech and language processing framework designed to support the Eurobin Coopetition. The system integrates speech-to-text, large language models (LLMs), and text-to-speech functionalities across two interconnected modules.

System Overview

1. ASR + LLM Module (`asr_llm`)

Purpose: Captures audio from a microphone, transcribes it to text, and generates a structured plan that satisfies the Eurobin Coopetition setup.
Key Outputs:
- Generated Plan: A structured plan derived using advanced language models and custom grammars.
  - Chain of Thought: A step-by-step reasoning process (part of the generated plan) published for downstream use.
Technology Stack:
- faster-whisper
- Llama-cpp-python

2. Speech Generation Module (`speech_gen`)

Purpose: Takes the Chain of Thought output from the asr_llm module and generates high-quality synthetic speech for human interaction.
Technology Stack:
- coqui-tts

Contact

For questions or support, please contact:

Dionis Totsila: [email protected]

Citing this work

If you use our code or part of it in your research, please cite our paper:

@misc{amadio2024vocalinstructionshouseholdtasks,
      title={From Vocal Instructions to Household Tasks: The Inria Tiago++ in the euROBIN Service Robots Coopetition}, 
      author={Fabio Amadio and Clemente Donoso and Dionis Totsila and Raphael Lorenzo and Quentin Rouxel and Olivier Rochel and Enrico Mingo Hoffman and Jean-Baptiste Mouret and Serena Ivaldi},
      year={2024},
      eprint={2412.17861},
      archivePrefix={arXiv},
      primaryClass={cs.RO},
      url={https://arxiv.org/abs/2412.17861}, 
}

Acknowledgements

This research was supported by:

CPER CyberEntreprises
Creativ’Lab platform of Inria/LORIA
EU Horizon project euROBIN (GA n.101070596)
France 2030 program through the PEPR O2R projects AS3 and PI3 (ANR-22-EXOD-007, ANR-22-EXOD-004)

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
asr_llm		asr_llm
assets		assets
speech_gen		speech_gen
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Speech and Language Processing System for Eurobin Coopetition

System Overview

1. ASR + LLM Module (`asr_llm`)

2. Speech Generation Module (`speech_gen`)

Contact

Citing this work

Acknowledgements

About

Releases

Packages

Languages

hucebot/eurobin_llm_plan

Folders and files

Latest commit

History

Repository files navigation

Speech and Language Processing System for Eurobin Coopetition

System Overview

1. ASR + LLM Module (asr_llm)

2. Speech Generation Module (speech_gen)

Contact

Citing this work

Acknowledgements

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

1. ASR + LLM Module (`asr_llm`)

2. Speech Generation Module (`speech_gen`)

Packages