The Open Access AI Cloud
This week, we’re diving into Hyperbolic, an open-access AI cloud making waves.
Hyperbolic’s bold mission is to democratize AI by offering affordable compute power for inference.
But before that, let’s kick things off with what we found most interesting about Hyperbolic…
Hyperbolic is pushing boundaries by addressing one of the toughest challenges in AI: verifying that an output truly comes from the specified AI model
This issue is especially tricky with centralized, closed-source providers like OpenAI. When you ask GPT-4 for an output, how can you be sure you’re not getting shortchanged—say, by OpenAI running a cheaper GPT-3.5 model instead (at 1/20 of the price per token)?
Currently, this assurance relies on reputation, but Hyperbolic believes this should be handled in a trustless, decentralized way.
There are a few ways to do this currently:
However, both have limitations:
Hyperbolic aims to overcome these drawbacks with its Proof of Sampling (PoSP) protocol and Sampling Machine Learning (SpML). SpML leverages sampling and game theory to encourage honest behaviour without constant oversight.
It’s based on a game-theoretic concept known as the pure strategy Nash Equilibrium, where all participants have a clear incentive to act honestly because the costs of cheating outweigh the potential gains.
The easiest mental model for this is to think of it as a public bus ticketing system.
Ticket inspectors conduct random checks only, so you might think passengers would frequently risk not buying a ticket. Surprisingly, they don’t because the penalties are high enough to deter cheating. As long as the penalty far exceeds the ticket's cost, honesty prevails.
Hyperbolic’s SpML uses economic incentives to address the limitations of current verification mechanisms like OpML and zkML. It provides a balance of speed and security without a heavy computational burden.
The caveat? It assumes that everyone acts rationally, which may not always be the case.
If SpML works well in practice, it will be a game-changer for decentralized AI applications, making trustless, verified inference a reality.
Training AI is expensive.
Electricity and access to compute are some of the biggest costs to companies and startups. The cost of the computational power required to train models doubles every nine months.
GPT-3 costs around $4M (2020), while GPT-4 (2023) costs a staggering $190M of computing to train.
Only well-resourced organizations can keep up at this rate. Smaller participants and retail enthusiasts are priced out. One Stanford postdoc was blocked from conducting his research because he couldn’t afford the thousands of GPUs needed.
One major challenge in decentralized compute networks is managing heterogeneous hardware—not just top-tier Nvidia chips but a broad spectrum of GPUs.
Hyperbolic’s Decentralized Operating System is the core of its compute network. It will seamlessly cluster resources with built-in auto-scaling and fault tolerance.
Hyperbolic’s breakthrough is in how it handles this complexity.
Other marketplaces may offer decentralized GPUs, but they often lack the sophisticated optimization Hyperbolic delivers, placing the burden of performance tuning on the user.
Hyperbolic streamlines this with an API that provides access to AI models optimized for various hardware, making global compute resources more accessible.
Hyperbolic released its limited alpha version of its GPU marketplace on August 15th, allowing 100 waitlisted members to trial GPU rental. You can sign up for their waitlist here.
The next component of Hyperbolic’s AI ecosystem is the AI services layer, which offers capabilities like inference, model training, model evaluation, and Retrieval Augmented Generation (RAG).
Within the Hyperbolic app, you can easily run top open-source models like Llama 3.1 405B and Hermes 3 70B. To fine-tune outputs, you can adjust hyperparameters such as max tokens, temperature, and top P.
On August 14th, we processed a RECORD-BREAKING 130 million LLM tokens in a single day—the HIGHEST VOLUME we've ever recorded!
We're proud to be the sole provider of the Llama 3.1 405B Base model and one of the few offering the Llama 3.1 405 Instruct bf16 model currently… x.com/i/web/status/1…
— Hyperbolic (@hyperbolic_labs) 7:06 PM • Aug 15, 2024
This setup, accessible via a straightforward API, simplifies the process for developers, making it as easy as using the OpenAI API but at a fraction of the cost.
Hyperbolic’s platform opens the door to innovative AI applications, including:
• Revenue Sharing for AI Agents: Tokenizing ownership of AI agents to redistribute revenue.
• AI-Powered DAOs: Leveraging AI for governance decisions.
• Fractionalized GPU Ownership: Enabling users to own and trade fractions of GPUs.
At the core of Hyperbolic’s infrastructure is its blockchain, underpinning the orchestration, services, and verification layers. The blockchain handles settlement and governance for Hyperbolic’s open-access AI cloud. It also powers the arbitration and verification mechanisms of the PoSP technology.
While details on the blockchain aspect are still sparse, you can expect Hyperbolic to reveal more about this soon.
Hyperbolic is still in the testnet phase. They raised $7m in a seed round led by Polychain Capital and Lightspeed Faction.
Interestingly, Hyperbolic is the sole provider of the Llama 3.1 405B base model.
Base models are the initial pre-trained versions of LLMs that haven’t undergone fine-tuning or reinforcement learning with human feedback (RLHF). It offers some advantages:
Beware: Base models are wild, though!
I tried to write a prompt to show how base LLMs differ from the RLHF-tuned ones everyone knows, and I think this gives a bit of the flavor.
A message from Llama 3.1 405B (base), on whether it’s useful to talk to base LLMs:
— Riley Goodside (@goodside) 3:08 AM • Aug 22, 2024
Users can sign up for the waitlist for GPU rental and supply here.
Hyperbolic also runs community missions with tasks that reward users with discord roles. You can join their discord here.
Dr. Jasper (Yue) Zhang is the CEO and co-founder of Hyperbolic Labs. He was previously a senior blockchain researcher at Ava Labs and a quantitative researcher at Citadel Securities. He completed his Ph.D. in math at UC Berkeley in two years and won gold medals at the Alibaba Global Math Competition and the Chinese Mathematical Olympiad.
Dr. Yuchen Jin is the CTO and co-founder of Hyperbolic Labs. He holds a Ph.D. in CS systems and networking from the University of Washington. He previously worked at OctoML, a company that delivers infrastructure to run, tune, and scale generative AI applications.
You can check out the rest of the team here.
What a week for @hyperbolic_labs!
> Still shocked by how we increased DAU and WAU by 1000+% over the past 2 weeks. Excited to see a lot of AI researchers and developers calling our API and spreading good words.
> We hosted Llama 3.1 405B unquantized model on the same day of its… x.com/i/web/status/1… — Jasper Zhang 张钺🤘🌪️ (@zjasper666) 7:00 PM • Aug 3, 2024
Cheers,
Teng Yan & Joshua
This newsletter is intended solely for educational purposes and does not constitute financial advice. It is not an endorsement to buy or sell assets or make financial decisions. Always conduct your own research and exercise caution when making investment choices.