• CameronDev@programming.dev
    link
    fedilink
    English
    arrow-up
    2
    ·
    7 months ago

    Isnt this a solved problem? Cisco video conference things do this already, their cameras will swivel around to point at the active speaker.

    • venusaur@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      7 months ago

      I think this is different. Cisco’s audio triangulation: “Audio triangulation - The microphone array behind the fabric panel that is position behind the camera pictured above is able to accurately locate voices within the room. The microphones are only used for audio triangulation .”

      the robot is different because it’s using binaural which uses the two “microphone” on our head (ears) and if doing it accurately, it should calculate how the sound is being received based on the pinnae and other shapes of the ear.