whyVoice

Introduction

Smart assistants, such as Google Assistant, Siri, and Alexa, have been deeply integrated into our lives. With one simple command, people can wake up their smart assistant and ask them to do them a favor ranging from setting up a timer to turning on the light. However, built on voice commands, current smart assistants do have some limitations. One of them is that mute people cannot use smart assistants unless by manually opening the app and type the command. When it's not very convenient to speak, communicating with a smart assistant is also not a very good experience. That's why we built whyVoice, an idea of using smart assistants without the need to say anything.

Our system is expected to recognize a key rhythm (such as the sound of knocking the desk with a specific rhythm), record the hand gesture movement using its camera, and output the classification of the hand gesture video (such as turn on the light). The whole experience does not need the user to speak a word.

Usage

YouTube demo

Create the conda environment

git clone https://github.com/kevinsu628/whyVoice.git
conda env create -f environment.yml

Download hand gesture recognition model

Download using this link

## relocate the hand gesture model
mv /path_to_model src/hgr/model

Build

Option 1: If you would like to build on ROS:

Our system is currently running on (Robotics Operating System) ROS on a Ubuntu 20.4. Although achieving our idea doesn't strictly require ROS (i.e. you can make them work in a jupyter notebook), we did it so that it can be used in a real hardware one day.

Follow this link to install ROS Noetic on Ubuntu 20.04.
Modify model path (absolute path suggested) in KWS node and HGR node
Launch the program

## in terminal 1 to run roscore: 
roscore

## in terminal 2 to run keyword spotting node and hand gesture recognition node
catkin_make
source devel/setup.bash
roslaunch whyvoice.launch

## Optional: To view the real time keyword spotting output in terminal 3:
rostopic echo /kws

Note:

Remember to run "source /opt/ros/noetic/setup.bash" every time you start a terminal to activate the ROS environment.
To make python in ROS and python in Anaconda work together is tricky. You need to set up PYTHONPATH so that packages of both conda and ROS can be found. If you have any error related to python path, try something like this command:

export PYTHONPATH=/home/username/anaconda3/envs/whyvoice/lib/python3.6/site-packages:
$PYTHONPATH:/usr/lib/python3/dist-packages/

Follow the rhythm used in the YouTube demo linked above to activate the system (knock the desk 4 times). The system should wake up the camera and start hand gesture recognition. Try the following 4 gestures:

swipe to right
swipe to left
hand zoom in
hand zoom out

Option 2: If you would like to run pure Python on any system:

Stay tuned for the non-ROS version of our this project. I will create a new branch later.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
src		src
.catkin_workspace		.catkin_workspace
.gitignore		.gitignore
environment.yml		environment.yml
readme.md		readme.md
whyvoice.launch		whyvoice.launch

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

whyVoice

Introduction

Usage

Create the conda environment

Download hand gesture recognition model

Build

Option 1: If you would like to build on ROS:

Option 2: If you would like to run pure Python on any system:

About

Uh oh!

Releases

Packages

Languages

kevinsu628/whyVoice

Folders and files

Latest commit

History

Repository files navigation

whyVoice

Introduction

Usage

Create the conda environment

Download hand gesture recognition model

Build

Option 1: If you would like to build on ROS:

Option 2: If you would like to run pure Python on any system:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages