Tour Guide Robot

Overview

The Tour Guide Robot is a comprehensive robotics system designed for autonomous guided tours in various environments. Built on the R1 humanoid robot platform, this system combines advanced navigation, natural language processing, computer vision, and human-robot interaction capabilities to create engaging and intelligent touring experiences.

The system integrates YARP (Yet Another Robot Platform) and ROS2 frameworks to provide a robust foundation for real-time robotics applications, featuring modular architecture that supports both physical robot deployment and simulation environments.

Key Features

🤖 Complete Robotics Stack

Navigation & SLAM: Advanced autonomous navigation with simultaneous localization and mapping
Hardware Integration: Support for various sensors including RPLidar, cameras, and IMUs
Real-time Control: Precise motor control and sensor fusion for smooth robot operation

🗣️ Conversational AI & Speech Processing

Natural Language Processing: Integration with Google Cloud Speech services and Azure OpenAI
Wake Word Detection: Voice activation using Porcupine wake word technology
Multi-language Support: Tour content and interactions in multiple languages
Speech Synthesis: High-quality text-to-speech for engaging tour narration

👁️ Computer Vision & Human Interaction

Facial Expression Control: Animated expressions with synchronized audio-visual feedback
Eye Contact Management: Intelligent gaze control for natural human interaction
Crowd Detection: Real-time human detection and tracking around Points of Interest
Visual Feedback: Dynamic visual responses to enhance user engagement

🌐 Simulation & Development Environment

Gazebo Integration: Complete simulation environment with physics-based robot models
Docker Containerization: Multiple specialized environments for different development needs
Cross-platform Support: Consistent development experience across different systems

Architecture

The system is built with a modular architecture consisting of:

Core Applications (app/): Main robot control, navigation, and tour management applications
Auxiliary Modules (aux_modules/): Specialized modules for human interaction, speech processing, and environmental sensing
Docker Environments (docker_stuff/): Containerized development and deployment environments
Communication Interfaces (interfaces/): YARP communication protocols defined by thrift files
Behavior Trees (skills/): Modular robot behaviors and decision-making logic

Quick Start

Using Docker (Recommended)

The fastest way to get started is using our pre-built Docker environments:

# Clone the repository
git clone https://github.com/hsp-iit/tour-guide-robot.git
cd tour-guide-robot

# Choose your environment:
cd docker_stuff/docker_tourCore    # For complete robotics development
cd docker_stuff/docker_sim         # For simulation and testing
cd docker_stuff/docker_talk        # For conversational AI development

# Build and run
chmod +x manage-docker.sh
./manage-docker.sh --build --ubuntu --ros_distro jazzy --devel
./manage-docker.sh --ubuntu --ros_distro jazzy --devel

Native Installation

For native development, install the required dependencies:

# Install YARP and ROS2
conda create -n robotics-env
conda activate robotics-env
conda install -c conda-forge -c robotology yarp
# Follow ROS2 Jazzy installation instructions

# Build the project
mkdir build && cd build
cmake .. -DOPENCV_ON=ON
make -j$(nproc)

Available Docker Environments

We provide four specialized Docker environments for different use cases:

Environment	Use Case	Key Features
docker_tourCore	Real robot development	Complete YARP+ROS2 stack, hardware interfaces
docker_sim	Simulation & testing	Gazebo integration, virtual environments
docker_talk	Conversational AI	Speech services, LLM integration, wake word
docker_tourCore_cuda_custom	GPU development	Custom CUDA versions, accelerated computing

📖 View Complete Docker Documentation

Auxiliary Modules

The system includes specialized auxiliary modules for enhanced robot capabilities:

Core Interaction Modules

Tour Manager: Central orchestration for guided tours with Points of Interest management
Head Synchronizer: Audio-visual coordination for speech and facial expressions
Speech Processing: Complete voice interaction pipeline with wake word detection

Human Interaction Enhancement

Eye Contact Manager: Intelligent gaze control during navigation and interaction
Face Expression: Animated facial expressions with real-time audio synchronization
Crowd Detector: Human detection and tracking around tour locations

Navigation & Environment

Head Obstacles Scanner: Environmental scanning with head movement control
Google Logger: Interaction logging and analytics

📖 View Complete Auxiliary Modules Documentation

System Requirements

Minimum Requirements

OS: Ubuntu 20.04+ or Docker support
Memory: 8GB RAM
Storage: 20GB available space
Network: Internet connection for cloud services

Recommended for Full Features

OS: Ubuntu 24.04
Memory: 16GB+ RAM
GPU: NVIDIA GPU with CUDA support (for vision modules)
Network: Stable internet for speech and AI services

Hardware Robot Requirements

R1 humanoid robot platform
RPLidar sensor
RGB-D cameras
Microphone array
Speaker system

Configuration

The system supports flexible configuration through:

INI Configuration Files: Module-specific settings and parameters
JSON Tour Definitions: Points of Interest, routes, and tour content
Docker Environment Variables: Container-specific configuration
CMake Build Options: Feature selection and dependency management

Example basic tour configuration:

[GENERAL]
name tourManager
period 1.0
robot_name r1

[TOURS]
tour_file tours.json
movements_file movements.json
language en-US

Development

Building from Source

# Standard build
cmake .. -DCMAKE_BUILD_TYPE=Release
make -j$(nproc)

# With OpenCV modules (vision features)
cmake .. -DCMAKE_BUILD_TYPE=Release -DOPENCV_ON=ON
make -j$(nproc)

# With additional features
cmake .. -DENABLE_faceExpressionImage=ON -DSILERO_VAD=ON
make -j$(nproc)

Module Development

The modular architecture allows for easy extension:

Add new auxiliary modules in aux_modules/
Extend behavior trees in skills/
Create custom applications in app/
Define new interfaces in interfaces/

Deployment

Physical Robot Deployment

# Start core services
yarpserver &
./tourManager --from config.ini &
./headSynchronizer &

# Start interaction modules
./eyeContactManager &
./faceExpression &
./speechProcessing/wakeWordDetection &

Simulation Environment

# Launch Gazebo simulation
gz sim world.sdf &

# Start robot applications
./tourManager --simulation &
yarpmanager

Support & Documentation

Wiki: Comprehensive documentation and tutorials
Docker Environments: Container setup and usage
Auxiliary Modules: Module-specific documentation
Issues: Report bugs and request features via GitHub Issues

Applications Note

Some applications don't behave nicely on yarpmanager (ex gazebo due to clock), so they are run separately with a .sh file.

Contributing

We welcome contributions! Please:

Fork the repository
Create a feature branch
Follow existing code style and documentation standards
Test your changes thoroughly
Submit a pull request

License

This project is developed by the Human-Robot Interaction Lab at the Italian Institute of Technology (IIT). Please refer to the repository for licensing information.

Name		Name	Last commit message	Last commit date
Latest commit History 690 Commits
.github/workflows		.github/workflows
app		app
aux_modules		aux_modules
bin		bin
bt_server		bt_server
docker_stuff		docker_stuff
interfaces		interfaces
skills		skills
system-architecture		system-architecture
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
README.md		README.md

hsp-iit/tour-guide-robot

Folders and files

Latest commit

History

Repository files navigation

Tour Guide Robot

Overview

Key Features

🤖 Complete Robotics Stack

🗣️ Conversational AI & Speech Processing

👁️ Computer Vision & Human Interaction

🌐 Simulation & Development Environment

Architecture

Quick Start

Using Docker (Recommended)

Native Installation

Available Docker Environments

Auxiliary Modules

Core Interaction Modules

Human Interaction Enhancement

Navigation & Environment

System Requirements

Minimum Requirements

Recommended for Full Features

Hardware Robot Requirements

Configuration

Development

Building from Source

Module Development

Deployment

Physical Robot Deployment

Simulation Environment

Support & Documentation

Applications Note

Contributing

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 11

Uh oh!

Languages

Packages