The post Meta Introduces SAM Audio for Advanced Sound Isolation Using Multimodal Prompts appeared on BitcoinEthereumNews.com. Tony Kim Dec 16, 2025 16:47 MetaThe post Meta Introduces SAM Audio for Advanced Sound Isolation Using Multimodal Prompts appeared on BitcoinEthereumNews.com. Tony Kim Dec 16, 2025 16:47 Meta

Meta Introduces SAM Audio for Advanced Sound Isolation Using Multimodal Prompts



Tony Kim
Dec 16, 2025 16:47

Meta’s SAM Audio leverages multimodal prompts for audio separation, offering intuitive sound isolation capabilities. The model introduces state-of-the-art features for various audio processing tasks.

Meta AI has unveiled SAM Audio, a groundbreaking model designed to transform audio processing by enabling the isolation of sounds from complex audio mixtures using intuitive, multimodal prompts. This innovative model allows users to employ text, visual cues, or time segment marking to separate audio components, according to Meta AI.

Revolutionizing Audio Processing

Building on previous advancements, SAM Audio employs the Perception Encoder Audiovisual (PE-AV), a technical engine enhancing its performance in various audio separation tasks. This model mirrors the functionality of the Segment Anything Model (SAM), which revolutionized object segmentation in images and videos. SAM Audio aims to make audio separation more accessible and practical by adopting a user-friendly approach that aligns with natural human interaction with sound.

Technical Innovations

The core of SAM Audio is its ability to perform across multiple modalities, such as text, visual, and temporal cues, providing users with precise control over audio separation. This is achieved through three primary methods:

  • Text Prompting: Allows users to type specific sounds, like “dog barking,” to isolate them.
  • Visual Prompting: Enables clicking on objects or speakers in videos to isolate their audio.
  • Span Prompting: An innovative approach allowing users to mark time segments for target audio isolation.

The model’s architecture leverages a flow-matching diffusion transformer, encoding audio mixtures and prompts into a shared representation to generate target and residual audio tracks. This is supported by a robust data engine that synthesizes large-scale, high-quality separation data, enhancing the model’s applicability in real-world scenarios.

PE-AV: The Engine Behind SAM Audio

PE-AV, built on Meta’s open-source Perception Encoder model, extends advanced computer vision capabilities to audio. It aligns video features with audio, allowing accurate separation of visually grounded sources and inferring off-screen events. This temporal alignment supports high-precision multimodal audio separation, crucial for flexible and perceptually accurate outcomes.

Benchmarking and Evaluation

Meta has introduced SAM Audio Judge and SAM Audio-Bench to evaluate and benchmark audio separation models. SAM Audio Judge offers a reference-free, objective metric for assessing audio segmentation quality, while SAM Audio-Bench provides a comprehensive benchmark covering speech, music, and general sound effects using multimodal prompts.

These innovations position SAM Audio as a leading model in audio separation technology, achieving state-of-the-art results across various tasks and outperforming previous models in efficiency and quality. While challenges remain, such as the separation of similar audio events, the model’s capabilities in handling mixed-modality prompts demonstrate significant advancements in the field.

Looking Ahead

Meta envisions SAM Audio as a tool for empowering creators, researchers, and developers to explore new forms of expression and application development. The collaboration with partners like Starkey and 2gether-International highlights the model’s potential in advancing accessibility. SAM Audio marks a step towards more inclusive and creative AI, paving the way for future innovations in audio-aware technologies.

Image source: Shutterstock

Source: https://blockchain.news/news/meta-introduces-sam-audio-for-advanced-sound-isolation

Market Opportunity
LiveArt Logo
LiveArt Price(ART)
$0.0005589
$0.0005589$0.0005589
-0.21%
USD
LiveArt (ART) Live Price Chart
Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact [email protected] for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.

You May Also Like

Woodway Assurance receives $1 million in funding for data privacy assurance solution EviData

Woodway Assurance receives $1 million in funding for data privacy assurance solution EviData

OTTAWA, ON, Dec. 17, 2025 /PRNewswire/ – New Canadian technology company Woodway Assurance is proud to announce that it has closed an oversubscribed seed funding
Share
AI Journal2025/12/17 23:16
OpenVPP accused of falsely advertising cooperation with the US government; SEC commissioner clarifies no involvement

OpenVPP accused of falsely advertising cooperation with the US government; SEC commissioner clarifies no involvement

PANews reported on September 17th that on-chain sleuth ZachXBT tweeted that OpenVPP ( $OVPP ) announced this week that it was collaborating with the US government to advance energy tokenization. SEC Commissioner Hester Peirce subsequently responded, stating that the company does not collaborate with or endorse any private crypto projects. The OpenVPP team subsequently hid the response. Several crypto influencers have participated in promoting the project, and the accounts involved have been questioned as typical influencer accounts.
Share
PANews2025/09/17 23:58
BlackRock boosts AI and US equity exposure in $185 billion models

BlackRock boosts AI and US equity exposure in $185 billion models

The post BlackRock boosts AI and US equity exposure in $185 billion models appeared on BitcoinEthereumNews.com. BlackRock is steering $185 billion worth of model portfolios deeper into US stocks and artificial intelligence. The decision came this week as the asset manager adjusted its entire model suite, increasing its equity allocation and dumping exposure to international developed markets. The firm now sits 2% overweight on stocks, after money moved between several of its biggest exchange-traded funds. This wasn’t a slow shuffle. Billions flowed across multiple ETFs on Tuesday as BlackRock executed the realignment. The iShares S&P 100 ETF (OEF) alone brought in $3.4 billion, the largest single-day haul in its history. The iShares Core S&P 500 ETF (IVV) collected $2.3 billion, while the iShares US Equity Factor Rotation Active ETF (DYNF) added nearly $2 billion. The rebalancing triggered swift inflows and outflows that realigned investor exposure on the back of performance data and macroeconomic outlooks. BlackRock raises equities on strong US earnings The model updates come as BlackRock backs the rally in American stocks, fueled by strong earnings and optimism around rate cuts. In an investment letter obtained by Bloomberg, the firm said US companies have delivered 11% earnings growth since the third quarter of 2024. Meanwhile, earnings across other developed markets barely touched 2%. That gap helped push the decision to drop international holdings in favor of American ones. Michael Gates, lead portfolio manager for BlackRock’s Target Allocation ETF model portfolio suite, said the US market is the only one showing consistency in sales growth, profit delivery, and revisions in analyst forecasts. “The US equity market continues to stand alone in terms of earnings delivery, sales growth and sustainable trends in analyst estimates and revisions,” Michael wrote. He added that non-US developed markets lagged far behind, especially when it came to sales. This week’s changes reflect that position. The move was made ahead of the Federal…
Share
BitcoinEthereumNews2025/09/18 01:44