Willow Ventures

Meet ARGUS: A Scalable AI Framework for Training Large Recommender Transformers to One Billion Parameters | Insights by Willow Ventures


Yandex Unveils ARGUS: A Game-Changer in Recommender Systems

Yandex has launched ARGUS (AutoRegressive Generative User Sequential modeling), a cutting-edge transformer-based framework for recommender systems that boasts scalability of up to one billion parameters. This innovation positions Yandex alongside global tech giants like Google, Netflix, and Meta, who have successfully navigated the technical challenges of scaling recommender transformer models.

Breaking Technical Barriers in Recommender Systems

Recommender systems have historically confronted three significant challenges: short-term memory, limited scalability, and a lack of adaptability to evolving user behavior. Traditional architectures often truncate user history to a small window of recent interactions, overlooking valuable behavioral data. This results in a superficial understanding of user intent, missing out on long-term habits, shifting preferences, and seasonal trends. As item catalogs grow into the billions, these restrictive models fall short in precision and struggle to handle the computational demands of large-scale personalization, resulting in stale recommendations and diminished user engagement.

Only a few companies have successfully scaled recommender transformers beyond experimental phases. Google, Netflix, and Meta have made substantial investments in this domain, showcasing advancements through systems like YouTubeDNN, PinnerFormer, and Meta’s Generative Recommenders. With ARGUS, Yandex joins these leaders in demonstrating billion-parameter recommender models within operational environments. By analyzing entire behavioral timelines, ARGUS detects both overt and hidden correlations in user activities. This comprehensive approach allows the model to adapt to evolving user intents and cyclical patterns, such as proactively suggesting favorite tennis ball brands when summer approaches, rather than waiting for repeated user signals each year.


Technical Innovations Behind ARGUS

The ARGUS framework incorporates several groundbreaking advancements:

  • Dual-objective Pre-training: ARGUS splits autoregressive learning into two distinct tasks — next-item prediction and feedback prediction. This dual approach enhances the modeling of user preferences and mimics historical system behavior effectively.
  • Scalable Transformer Encoders: Models can be scaled from 3.2M to 1B parameters, with significant performance improvements across various metrics. Notably, at the billion-parameter scale, pairwise accuracy experiences a 2.66% uplift, indicating a scaling law in recommender transformers.
  • Extended Context Modeling: ARGUS can manage user histories containing up to 8,192 interactions in one go, which facilitates personalization based on months of behavior rather than just the most recent clicks.
  • Efficient Fine-tuning: Using a two-tower architecture, the model allows for offline computation of embeddings and scalable deployment, significantly lowering inference costs compared to prior target-aware or impression-level models.

Real-World Deployment and Measured Gains

ARGUS has been successfully deployed on Yandex’s music platform, serving millions of users. In production A/B testing, the system has achieved remarkable results including:

  • +2.26% increase in total listening time (TLT)
  • +6.37% improvement in indication of liking

These gains represent the largest quality enhancements recorded in the platform’s history for any deep learning–based recommender model.

Future Directions for ARGUS

Researchers at Yandex have ambitious plans to extend ARGUS towards real-time recommendation tasks, delve into feature engineering for pairwise ranking, and adapt the framework for high-cardinality domains like large e-commerce platforms and video services. The proven ability to scale user-sequence modeling with transformer architectures indicates a promising trajectory paralleling advancements in natural language processing.

Conclusion

With ARGUS, Yandex solidifies its position as a global leader in developing state-of-the-art recommender systems. By sharing its innovations openly, Yandex is enhancing personalization within its ecosystem while accelerating the advancement of recommendation technologies industry-wide.


Explore the full PAPER here. Special thanks to the Yandex team for their thought leadership and resources for this article.



Source link