Lyra (codec)

Lyra is a lossy audio codec developed by Google that is designed for compressing speech at very low bitrates. Unlike most other audio formats, it compresses data using a machine learning-based algorithm.

Lyra (codec)
Developed byGoogle
Initial release2021
Type of formatAudio codec
Free format?Yes

Features

The Lyra codec is designed to transmit speech in real-time when bandwidth is severely restricted, such as over slow or unreliable network connections.[1] It runs at a fixed bitrate 3 kbps and it is intended to provide better quality than codecs that use traditional waveform-based algorithms at similar bitrates.[2] Instead, compression is achieved via a machine learning algorithm that encodes the input with feature extraction, and then reconstructs an approximation of the original using a generative model.[1] This model was trained on thousands of hours of speech recorded in over 70 languages to function with various speakers.[2] Because generative models are more computationally complex than traditional codecs, a simple model that processes different frequency ranges in parallel is used to obtain acceptable performance.[3] Lyra operates on frames of 40 ms and Google claims it can run with 90 ms of latency on typical hardware.[1] Google's reference implementation is available for Android and Linux.[3]

Quality

Lyra performs significantly better than traditional codecs at similar bitrates.[1][3][4] Ian Buckley at MakeUseOf says, "It succeeds in creating almost eerie levels of audio reproduction with bitrates as low as 3 kbps." Google claims that is reproduces natural-sounding speech, and that Lyra at 3 kbps beats Opus at 8kps.[2] Tsahi Levent-Levi writes that Satin, Microsoft's AI-based codec, outperforms it at higher bitrates.[4]

History

Lyra was first announced in February 2021,[2] and in April, Google released the source code of their reference implementation.[1]

Support

Implementations

Google's implementation is available on Github under the Apache License.[1][5] Written in C++, it is optimized for 64-bit ARM but also runs on x86, on either Android or Linux.[3]

Applications

Google Duo uses Lyra to transmit sound for video chats when bandwidth is limited.[1][4]

References

  1. Buckley, Ian (2021-04-08). "Google Makes Its Lyra Low Bitrate Speech Codec Public". MakeUseOf. Retrieved 2022-07-21.
  2. "Lyra: A New Very Low-Bitrate Codec for Speech Compression". Google AI Blog. Retrieved 2022-07-21.
  3. "Google Duo uses a new codec for better call quality over poor connections". XDA. 2021-04-09. Retrieved 2022-07-21.
  4. Levent-Levi, Tsahi (2021-04-19). "Lyra, Satin and the future of voice codecs in WebRTC". BlogGeek.me. Retrieved 2022-07-21.
  5. Google (2021). "Lyra: A Very Low-Bitrate Codec for Speech Compression". GitHub. Retrieved 21 July 2022.

See also

This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.