I can probably name 10 crypto projects going after decentralized AI inference from completely different angles rn. This lane is attracting some of the best AI dev talent in crypto rn.


The fight is getting more complex with the race for hardware, trust models, latency profiles, and who can turn raw compute into usable inference without lighting money on fire.
I tried to map out some of the projects in the game so you can see it more clearly.
[1] @darkbloomai
@EigenLabs is trying to turn idle Apple Silicon Macs into a privacy-first inference network.
Millions of M-series Macs already exist, marginal cost is mostly electricity, and Apple Silicon has unified memory up to 64–512GB with huge bandwidth.
So they want the spare Mac sitting on someone's desk to become part of the AI infra layer.
Users send inference requests, Eigen-operated coordinators route them, providers run the model on eligible Macs, and the provider supposedly can't see the prompt or output.
– top provider earns around $6 and the fifth around $2
– 30-day earnings were closer to ~$6 total against calculator projections of $280–600/month
[2] @nosana_ai
Solana-based inference network focused on AI jobs, scheduling, GPU hosts, and developers who want cheaper inference without thinking too much about infra.
– 50K+ registered GPU hosts
– 600 daily active nodes across 60+ countries
They give devs a cheaper inference lane and let Solana coordinate the marketplace.
[3] @rendernetwork
They already had the creative GPU network, then slowly expanded into AI compute and inference.
– 5,600+ GPU nodes
– 24.3M frames rendered in 2025
– AI already accounts for around 35–40% of volume
– Dispersed GPU around $0.69/hr
Rendering and inference are not the same workload, but Render already knows how to coordinate distributed GPU demand, payments, supply reputation, and creator workflows.
[4] @akashnet
Kubernetes-native, reverse auctions, rent compute via SDL, flexible enough for many workloads.
Akash is real, mature, cheap, useful, but not perfectly optimized for inference.
It can be a cost-relief valve, burst capacity, batch jobs, and self-hosted infra. But for low-latency inference UX, generalized cloud has to compete with specialized routers.
[5] @ionet
100K+ registered GPUs, H100 pricing around $1.49–2.20/hr versus much higher hyperscaler pricing, Ray-style orchestration, and now confidential compute with Intel TDX-enabled H100/H200/B200 tiers.
is going after scale and enterprise-grade positioning.
[6] $Tao subnets (SN64 + SN4)
Chutes has ~4,400 H100-equivalent supply, 9.1T total tokens served, 50B+ tokens/day at peak, and is apparently one of the top OpenRouter providers.
Targon is leaning into confidential inference with Intel TDX + NVIDIA confidential computing, 1,500+ H200s, and a claimed ~$10M ARR.
Latency is still the wall. Most decentralized networks are nowhere close to centralized providers for real-time chat.
But they don't need to win every workload.
Batch inference, async agents, code tasks, image/video generation, offline research, background agent loops, all of these can tolerate worse latency if cost and privacy are better.
That's why I believe the bigger the AI market gets, the more room there is for crypto projects to carve out their own lane.
post-image
This page may contain third-party content, which is provided for information purposes only (not representations/warranties) and should not be considered as an endorsement of its views by Gate, nor as financial or professional advice. See Disclaimer for details.
  • Reward
  • Comment
  • Repost
  • Share
Comment
Add a comment
Add a comment
No comments