System Architecture Overview

The X "For You" feed algorithm is orchestrating a complex flow of data across multiple specialized services to deliver a personalized experience in real-time. This guide walks you through the system architecture by tracing the lifecycle of a single feed request.

The Recommendation Lifecycle

When a user opens their "For You" feed, the request moves through four distinct stages: Retrieval, Hydration, Ranking, and Filtering. All of this is managed by the Home Mixer, the central orchestration layer.

1. Initiation: The Home Mixer

The process begins at the HomeMixerServer. It acts as the gRPC gateway that receives the request and coordinates the downstream services.

// Example of the Home Mixer's entry point (Simplified)
let service = HomeMixerServer::new().await;

// The Mixer calls the Candidate Pipeline to begin gathering posts
let scored_posts = service.get_scored_posts(request).await?;

2. Retrieval: Gathering Candidates

The system fetches candidate posts from two primary sources to ensure a balance between content from friends and new discovery.

In-Network (Thunder): This service maintains a massive in-memory store of recent posts from the accounts a user follows. It consumes events from Kafka to keep its PostStore fresh.
Out-of-Network (Phoenix Retrieval): This uses a "Two-Tower" model architecture. It encodes the user’s recent history into a vector and performs an approximate nearest neighbor (ANN) search against a global corpus of posts to find content the user might like from accounts they don't follow yet.

3. Hydration: Preparing Data for the Transformer

Once candidates are gathered, they are "hydrated." This means the system attaches necessary metadata and features required for ranking.

User Action Sequence: The system looks up the user's recent engagement history (likes, replies, shares).
Embedding Lookup: For the Phoenix model to understand the content, post and author IDs are converted into high-dimensional embeddings.

4. Ranking: The Phoenix Transformer

This is the core of the system. All candidates (both in-network and out-of-network) are sent to Phoenix, a Grok-based transformer model. Unlike traditional systems with thousands of hand-tuned rules, Phoenix uses a transformer architecture to predict the probability of engagement.

The ranking process follows this logic in the model:

Reduce Hashes: User, post, and author hashes are projected into a unified embedding space using block_user_reduce and block_history_reduce.
Apply Attention Mask: A specialized make_recsys_attn_mask is used. This allows candidates to "attend" to the user's history but prevents them from seeing other candidates, ensuring each post is evaluated independently relative to the user's preferences.
Compute Scores: The model outputs logits representing various engagement types (e.g., probability of a Like vs. probability of a Reply).

# The transformer evaluates candidates against user history
# From phoenix/recsys_model.py
output = model(
    user_embeddings=user_emb,
    history_embeddings=hist_emb,
    candidate_embeddings=cand_emb
)

5. Post-Processing: Filtering and Delivery

The final stage occurs back in the Candidate Pipeline. Before the posts are sent to the user's device, the system applies final heuristics:

Filtering: Removes posts the user has already seen, blocked content, or low-quality/spam posts.
Mixing: Ensures a healthy balance between different content types (e.g., images vs. text) and sources (In-Network vs. Out-of-Network).
Side Effects: Logs the results for future model training.

Component Overview

By separating retrieval (finding what to show) from ranking (deciding how to order it), the architecture maintains high throughput while utilizing heavy-duty machine learning models to maximize user relevance.