You must log in or register to comment.
Isnt this a solved problem? Cisco video conference things do this already, their cameras will swivel around to point at the active speaker.
I think this is different. Cisco’s audio triangulation: “Audio triangulation - The microphone array behind the fabric panel that is position behind the camera pictured above is able to accurately locate voices within the room. The microphones are only used for audio triangulation .”
the robot is different because it’s using binaural which uses the two “microphone” on our head (ears) and if doing it accurately, it should calculate how the sound is being received based on the pinnae and other shapes of the ear.
emphasis on the estimating