icon
icon
icon
icon
Upgrade
Upgrade

News /

Articles /

Nvidia Unveils Fugatto: The AI Audio Revolution Transforming Soundscapes

Word on the StreetTuesday, Nov 26, 2024 1:00 am ET
1min read

Nvidia has recently unveiled Fugatto, a pioneering generative AI audio model that promises to revolutionize the fields of music, film, and gaming by enabling unprecedented sound generation and manipulation. Dubbed the Foundational Generative Audio Transformer Opus, Fugatto allows users to generate music and sound effects through text prompts, modify existing audio files, and even create entirely new audio experiences.

This innovative model boasts the ability to transform a piano performance into a vocal rendition or alter speech accents and emotions in real time. By employing ComposableART technology, Fugatto can amalgamate disparate audio features encountered during training, resulting in novel sound configurations like a ‘barking trumpet’ or a ‘meowing saxophone’.

Fugatto's capabilities are powered by 2.5 billion parameters, a massive scale achieved through rigorous training on approximately 20 million audio samples from global open-source datasets. Developed by a diverse team spanning countries such as India, Brazil, China, Jordan, and South Korea, the model excels at handling multiple languages and accents.

Rafael Valle, Nvidia's Audio Research Manager, expressed, "Our aspirations were to build a model capable of understanding and producing sounds as intuitively as a human might." He further explained that this model could redefine how soundtracks and video game audio assets are tailored based on user interaction dynamics.

Despite its groundbreaking potential, Fugatto is not currently slated for public release. Vice President of Applied Deep Learning Research at Nvidia, Bryan Catanzaro, emphasized that while Fugatto could significantly enhance creativity in audio production, careful consideration is needed to mitigate misuse risks, such as creating misleading content or violating copyrights.

The introduction of Fugatto places Nvidia in direct competition with other tech giants like Meta Platforms, who recently introduced Movie Gen, an AI model designed to craft realistic video and audio clips from user inputs. As the landscape of AI audio tools expands, Nvidia's Fugatto stands out with its unique promise of creating never-before-heard sounds, although the pathway to its public application remains cautiously grounded.

Comments

Add a public comment...
Post
Refresh
Disclaimer: the above is a summary showing certain market information. AInvest is not responsible for any data errors, omissions or other information that may be displayed incorrectly as the data is derived from a third party source. Communications displaying market prices, data and other information available in this post are meant for informational purposes only and are not intended as an offer or solicitation for the purchase or sale of any security. Please do your own research when investing. All investments involve risk and the past performance of a security, or financial product does not guarantee future results or returns. Keep in mind that while diversification may help spread risk, it does not assure a profit, or protect against loss in a down market.
You Can Understand News Better with AI.
Whats the News impact on stock market?
Its impact is
fork
logo
AInvest
Aime Coplilot
Invest Smarter With AI Power.
Open App