How the X Algorithm Actually Works: A Deep Dive into the...

I spent a weekend digging through the open-sourced X (formerly Twitter) algorithm. What I found was both fascinating and practical: a Grok-based transformer model that predicts your behavior with surprising sophistication.

This isn’t speculation. This is what the code actually does.

X Algorithm Pipeline

The Architecture: Phoenix, Thunder, and Home Mixer

The “For You” feed is powered by three main systems:

1. Phoenix: The Brain

Phoenix is a Grok-based transformer model (yes, the same transformer architecture family as xAI’s Grok). It handles two critical functions:

Retrieval: Finding relevant posts from millions of candidates using a two-tower model
Ranking: Scoring posts by predicting engagement probabilities

The ranking model uses a clever technique called candidate isolation: posts can’t “see” each other during scoring, only the user’s context. This ensures consistent, cacheable scores.

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
User Context (history, preferences)
         │
         ▼
    ┌─────────────────┐
    │     Phoenix     │
    │   Transformer   │
    │                 │
    │  [User] → [Candidates]  
    │     ↓         ↓
    │   Full      Self-only
    │  Attention   Attention
    └─────────────────┘
         │
         ▼
    P(like), P(reply), P(repost), P(block)...

2. Thunder: The In-Network Source

Thunder is an in-memory post store that tracks recent posts from accounts you follow. It’s optimized for sub-millisecond lookups.

Key insight: In-network posts get preference. When you follow someone, their posts are more likely to appear in your feed than posts from strangers with similar engagement predictions.

3. Home Mixer: The Orchestrator

This is the glue. It:

Fetches your engagement history
Retrieves candidates from both Thunder (in-network) and Phoenix (out-of-network)
Hydrates posts with metadata
Filters ineligible content
Runs the scoring pipeline
Selects top candidates
Returns your ranked feed

The Scoring Formula

Here’s the actual scoring logic from the codebase:

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
// From weighted_scorer.rs
fn compute_weighted_score(candidate: &PostCandidate) -> f64 {
    let s: &PhoenixScores = &candidate.phoenix_scores;

    Self::apply(s.favorite_score, FAVORITE_WEIGHT)
        + Self::apply(s.reply_score, REPLY_WEIGHT)
        + Self::apply(s.retweet_score, RETWEET_WEIGHT)
        + Self::apply(s.photo_expand_score, PHOTO_EXPAND_WEIGHT)
        + Self::apply(s.click_score, CLICK_WEIGHT)
        + Self::apply(s.profile_click_score, PROFILE_CLICK_WEIGHT)
        + Self::apply(s.vqv_score, vqv_weight)
        + Self::apply(s.share_score, SHARE_WEIGHT)
        + Self::apply(s.share_via_dm_score, SHARE_VIA_DM_WEIGHT)
        + Self::apply(s.share_via_copy_link_score, SHARE_VIA_COPY_LINK_WEIGHT)
        + Self::apply(s.dwell_score, DWELL_WEIGHT)
        + Self::apply(s.quote_score, QUOTE_WEIGHT)
        + Self::apply(s.quoted_click_score, QUOTED_CLICK_WEIGHT)
        + Self::apply(s.dwell_time, CONT_DWELL_TIME_WEIGHT)
        + Self::apply(s.follow_author_score, FOLLOW_AUTHOR_WEIGHT)
        + Self::apply(s.not_interested_score, NOT_INTERESTED_WEIGHT)
        + Self::apply(s.block_author_score, BLOCK_AUTHOR_WEIGHT)
        + Self::apply(s.mute_author_score, MUTE_AUTHOR_WEIGHT)
        + Self::apply(s.report_score, REPORT_WEIGHT)
}

The model predicts the probability of each action, then multiplies by a weight. Positive actions add to your score. Negative actions subtract.

This is crucial: one block can hurt more than ten likes help.

The 19 Signals That Determine Your Reach

The Phoenix model predicts probabilities for 19 distinct user actions:

X Algorithm Signals

Positive Signals

Signal	What It Means	Code Reference
`favorite_score`	P(user will like)	`ServerTweetFav`
`reply_score`	P(user will reply)	`ServerTweetReply`
`retweet_score`	P(user will repost)	`ServerTweetRetweet`
`quote_score`	P(user will quote tweet)	`ServerTweetQuote`
`click_score`	P(user will click post)	`ClientTweetClick`
`profile_click_score`	P(user will click author profile)	`ClientTweetClickProfile`
`photo_expand_score`	P(user will expand image)	`ClientTweetPhotoExpand`
`vqv_score`	P(user will watch video quality view)	`ClientTweetVideoQualityView`
`share_score`	P(user will share)	`ClientTweetShare`
`share_via_dm_score`	P(user will DM post)	`ClientTweetClickSendViaDirectMessage`
`share_via_copy_link_score`	P(user will copy link)	`ClientTweetShareViaCopyLink`
`dwell_score`	P(user will dwell on post)	`ClientTweetRecapDwelled`
`quoted_click_score`	P(user will click quoted tweet)	`ClientQuotedTweetClick`
`follow_author_score`	P(user will follow author)	`ClientTweetFollowAuthor`
`dwell_time`	Expected dwell duration (continuous)	`ContinuousActionName::DwellTime`

Negative Signals

Signal	What It Means	Code Reference
`not_interested_score`	P(user clicks “not interested”)	`ClientTweetNotInterestedIn`
`block_author_score`	P(user will block author)	`ClientTweetBlockAuthor`
`mute_author_score`	P(user will mute author)	`ClientTweetMuteAuthor`
`report_score`	P(user will report post)	`ClientTweetReport`

The Author Diversity Penalty

Here’s something most people don’t know: if you post multiple times, your posts compete against each other.

Author Diversity Penalty

From the code:

1
2
3
4
// From author_diversity_scorer.rs
fn multiplier(&self, position: usize) -> f64 {
    (1.0 - self.floor) * self.decay_factor.powf(position as f64) + self.floor
}

The algorithm sorts your posts by score. The first gets full weight. Each subsequent post gets penalized:

1
2
3
4
5
Position 0 (1st post):  multiplier ≈ 1.00  (100%)
Position 1 (2nd post):  multiplier ≈ 0.55  (55%)
Position 2 (3rd post):  multiplier ≈ 0.33  (33%)
Position 3 (4th post):  multiplier ≈ 0.21  (21%)
Position 4 (5th post):  multiplier ≈ 0.16  (16%)

The exact decay factor and floor are excluded from the open source release, but the exponential decay pattern is clear.

Implication: Quality beats quantity. One excellent post outperforms five mediocre ones.

The Filtering Pipeline

Before scoring, posts go through a gauntlet of filters:

Filter	What It Does
`DropDuplicatesFilter`	Removes duplicate post IDs
`AgeFilter`	Removes posts older than threshold (~48h)
`SelfTweetFilter`	Removes your own posts from your feed
`RetweetDeduplicationFilter`	Dedupes reposts of same content
`PreviouslySeenPostsFilter`	Removes posts you’ve already seen
`PreviouslyServedPostsFilter`	Removes posts served in current session
`MutedKeywordFilter`	Removes posts with your muted keywords
`AuthorSocialgraphFilter`	Removes posts from blocked/muted accounts
`IneligibleSubscriptionFilter`	Removes paywalled content you can’t access
`VFFilter`	Post-selection visibility filter (spam, violence, etc.)

The MutedKeywordFilter is worth noting. If your post contains keywords that many users have muted, it’ll be filtered out for those users regardless of its score.

Video Quality Views: The Duration Threshold

Videos only contribute to VQV (Video Quality View) scoring if they meet a minimum duration:

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
// From weighted_scorer.rs
fn vqv_weight_eligibility(candidate: &PostCandidate) -> f64 {
    if candidate
        .video_duration_ms
        .is_some_and(|ms| ms > MIN_VIDEO_DURATION_MS)
    {
        VQV_WEIGHT
    } else {
        0.0
    }
}

The exact MIN_VIDEO_DURATION_MS is excluded, but industry standards suggest 2-3 seconds minimum. Videos shorter than this get zero weight for the video view signal.

Optimal Video Lengths

Duration	Best For	Notes
2-15 seconds	Loops, memes, quick hits	Fast engagement, easy shares
30-60 seconds	Tips, insights, reactions	Sweet spot for dwell time
1-2 minutes	Tutorials, stories, threads	Maximum dwell if engaging
2+ minutes	Educational deep dives	Only if content is compelling

Video Best Practices

✅ Do	❌ Don’t
Hook in first 1-2 seconds	Slow intros
Add captions	Sound-only content
Native upload	YouTube/TikTok links
Strong thumbnail	Boring first frame

Dwell Time: The Underrated Signal

The algorithm tracks two distinct dwell signals:

dwell_score: Binary: Did they stop scrolling?
dwell_time: Continuous: How long did they spend?

The dwell time is treated as a continuous value, not a probability:

1
2
// From phoenix_scorer.rs
dwell_time: p.get_continuous(ContinuousActionName::DwellTime),

This means longer content that holds attention is genuinely rewarded, not just content that stops the scroll.

Dwell Time by Duration

Duration	Signal Strength	What It Means
< 500ms	None	Scrolled past
500ms - 2s	Weak	Brief pause
2-5 seconds	Good	Genuine interest
5+ seconds	Great	Strong engagement
10+ seconds	Best	Deep engagement

How to Maximize Dwell Time

Write longer, multi-paragraph posts that take time to read
Use storytelling that keeps people engaged
Create threads with valuable content across multiple posts
Add multiple images to swipe through
Include detailed visuals people will examine

The Out-of-Network Penalty

In-network posts (from accounts you follow) get preference over out-of-network posts:

1
2
3
4
5
// From oon_scorer.rs
let updated_score = c.score.map(|base_score| match c.in_network {
    Some(false) => base_score * OON_WEIGHT_FACTOR,
    _ => base_score,
});

Out-of-network content is multiplied by OON_WEIGHT_FACTOR (a value less than 1). This means even a viral post from a stranger has to overcome a built-in handicap compared to a post from someone you follow.

Implication: Growing your follower count has compounding benefits beyond vanity. Your posts get an algorithmic boost with your followers.

The ML Model: Grok-Based Transformer

The ranking model is a transformer architecture ported from xAI’s Grok-1 release. Here’s what makes it interesting:

Hash-Based Embeddings

Instead of looking up users and posts in giant embedding tables, the model uses multiple hash functions:

1
2
3
4
5
6
# From recsys_model.py
@dataclass
class HashConfig:
    num_user_hashes: int = 2
    num_item_hashes: int = 2
    num_author_hashes: int = 2

This allows the model to handle any user or post ID without maintaining massive lookup tables.

The Input Structure

The model takes three components:

User embedding: Who is viewing
History embeddings: What they’ve engaged with recently
Candidate embeddings: Posts to be scored

1
[User] + [History (128 items)] + [Candidates (32 items)]

Candidate Isolation Attention Mask

Here’s the clever part. During attention, candidates can see the user and history, but cannot see each other:

1
2
3
4
5
         User  History  Candidates
User     [✓]    [✓]       [✗]
History  [✓]    [✓]       [✗]  
Cand_1   [✓]    [✓]       [✓ self only]
Cand_2   [✓]    [✓]       [✓ self only]

This ensures that a post’s score doesn’t depend on which other posts happen to be in the same batch. Scores are consistent and can be cached.

Practical Takeaways

Based on the code, here’s what actually moves the needle:

What to Optimize For

Replies: High-weight positive signal. Ask questions. Invite discussion.
Shares via DM: Surprisingly high weight. Create “send this to someone” content.
Profile clicks → Follows: The follow_author_score directly contributes to ranking.
Dwell time: Write substantive content that takes time to read.
Video quality views: Make videos 2+ seconds minimum, hook immediately.

What to Avoid

Blocks: Severe negative weight. Don’t be hostile.
Reports: Severe negative weight. Stay within guidelines.
Mutes: High negative weight. Don’t be spammy.
Excessive posting: The author diversity scorer will penalize your 3rd, 4th, 5th posts.

Content That Triggers Negative Signals

❌ Avoid This	Why It Hurts
Engagement bait (“Like if you agree!”)	Triggers “not interested”
Rage bait (intentionally provocative)	Triggers blocks and mutes
Spam patterns (same content repeatedly)	Triggers mutes
Excessive self-promotion	Triggers mutes and unfollows
Misleading headlines (clickbait)	Triggers “not interested”
Hostile replies (aggressive arguing)	Triggers blocks
Posting 5+ times per day	Diversity penalty + mutes

These behaviors lead to mutes and blocks which severely damage your reach: often more than the positive engagement you might get.

Optimal Posting Cadence

Given the ~48-hour post retention window and the author diversity penalty:

Posts/Day	Recommendation
1	Best reach per post
2	Good; space 12+ hours apart
3	Acceptable; space 8+ hours
4+	Diminishing returns

The Algorithm’s Core Question

All of this complexity reduces to one question the model is trying to answer:

“Will this specific user engage positively with this content?”

It’s not asking what’s objectively “good.” It’s predicting your personal behavior based on your history.

This is why generic growth hacks have diminishing returns. The algorithm is personalized. What works for one audience may not work for another.

The sustainable strategy is straightforward: create content that genuinely resonates with your specific audience, and avoid behaviors that trigger negative signals.

TL;DR Cheat Sheet

✅ Do This

Action	Impact
Ask questions (triggers replies)	🔥🔥🔥
Create shareable content	🔥🔥🔥
Write longer posts (dwell time)	🔥🔥
1-2 quality posts per day	🔥🔥
Space posts 10-12 hours apart	🔥🔥
Respond to replies quickly	🔥🔥
Use multiple images	🔥
Videos 30s-2min with strong hooks	🔥

❌ Avoid This

Action	Impact
Getting blocked	💀💀💀
Getting reported	💀💀💀
Posting 5+ times in 24 hours	💀💀
Spammy/repetitive content	💀💀
Engagement bait phrases	💀
Hostile replies	💀

Technical Notes

For those who want to dig deeper:

Model architecture: Transformer with special attention masking for candidate isolation
Inference: Predictions made per-user, not globally
Serving stack: Rust-based candidate pipeline (home-mixer/) calling Python ML models (phoenix/)
In-network source: thunder/; Redis-like in-memory post store with Kafka ingestion
Framework: JAX + Haiku for the ML components

The exact weight values for each signal are excluded from the open source release (noted as “Excluded from open source release for security reasons” in the code), but the relative importance is clear from the architecture.

What Transformers Taught Me About Attention: Understanding the attention mechanism that powers this algorithm

Changelog

2026-01-20: Initial comprehensive analysis of open-sourced X algorithm
2026-01-29: Added frontmatter metadata, minor formatting improvements

How the X Algorithm Actually Works: A Deep Dive into the Open-Sourced Code

The Architecture: Phoenix, Thunder, and Home Mixer

1. Phoenix: The Brain

2. Thunder: The In-Network Source

3. Home Mixer: The Orchestrator

The Scoring Formula

The 19 Signals That Determine Your Reach

Positive Signals

Negative Signals

The Author Diversity Penalty

The Filtering Pipeline

Video Quality Views: The Duration Threshold

Optimal Video Lengths

Video Best Practices

Dwell Time: The Underrated Signal

Dwell Time by Duration

How to Maximize Dwell Time

The Out-of-Network Penalty

The ML Model: Grok-Based Transformer

Hash-Based Embeddings

The Input Structure

Candidate Isolation Attention Mask

Practical Takeaways

What to Optimize For

What to Avoid

Content That Triggers Negative Signals

Optimal Posting Cadence

The Algorithm’s Core Question

TL;DR Cheat Sheet

✅ Do This

❌ Avoid This

Technical Notes

Further Reading

Changelog

The Architecture: Phoenix, Thunder, and Home Mixer#

1. Phoenix: The Brain#

2. Thunder: The In-Network Source#

3. Home Mixer: The Orchestrator#

The Scoring Formula#

The 19 Signals That Determine Your Reach#

Positive Signals#

Negative Signals#

The Author Diversity Penalty#

The Filtering Pipeline#

Video Quality Views: The Duration Threshold#

Optimal Video Lengths#

Video Best Practices#

Dwell Time: The Underrated Signal#

Dwell Time by Duration#

How to Maximize Dwell Time#

The Out-of-Network Penalty#

The ML Model: Grok-Based Transformer#

Hash-Based Embeddings#

The Input Structure#

Candidate Isolation Attention Mask#

Practical Takeaways#

What to Optimize For#

What to Avoid#

Content That Triggers Negative Signals#

Optimal Posting Cadence#

The Algorithm’s Core Question#

TL;DR Cheat Sheet#

✅ Do This#

❌ Avoid This#

Technical Notes#

Related#

Further Reading#

Changelog#

The Architecture: Phoenix, Thunder, and Home Mixer

1. Phoenix: The Brain

2. Thunder: The In-Network Source

3. Home Mixer: The Orchestrator

The Scoring Formula

The 19 Signals That Determine Your Reach

Positive Signals

Negative Signals

The Author Diversity Penalty

The Filtering Pipeline

Video Quality Views: The Duration Threshold

Optimal Video Lengths

Video Best Practices

Dwell Time: The Underrated Signal

Dwell Time by Duration

How to Maximize Dwell Time

The Out-of-Network Penalty

The ML Model: Grok-Based Transformer

Hash-Based Embeddings

The Input Structure

Candidate Isolation Attention Mask

Practical Takeaways

What to Optimize For

What to Avoid

Content That Triggers Negative Signals

Optimal Posting Cadence

The Algorithm’s Core Question

TL;DR Cheat Sheet

✅ Do This

❌ Avoid This

Technical Notes

Related

Further Reading

Changelog