Rafay Systems Transforms GPU Providers Into AI Factories By Empowering Them to Monetize Token-Metered Access to AI Models
PR Newswire
SUNNYVALE, Calif., April 2, 2026
New capabilities in the Rafay Platform provide AI factories and neocloud operators with the metering and monetization layer needed to offer token-based AI services to enterprises and retail users
SUNNYVALE, Calif., April 2, 2026 /PRNewswire/ -- Rafay Systems, a leader in infrastructure orchestration for AI and cloud-native workloads, today announced the general availability of Token Factory, a suite of capabilities in the Rafay Platform that deliver token-based access to AI models and services. Token-based access to models and other AI services has quickly become a foundational requirement in the AI industry and is what sets apart AI factory operators from commodity GPU providers.
Rafay's Token Factory gives AI factory operators and neoclouds the metering, pricing and access-control capabilities needed to monetize token-based access to AI models running on accelerated computing infrastructure. With Token Factory, AI factory operators can immediately deliver token-metered access to AI models as a service through developer-friendly consumption workflows without needing to build the orchestration and monetization stack from scratch.
This general availability announcement arrives as AI consumption is undergoing a fundamental shift. Enterprises and developers are increasingly accessing AI models through agentic frameworks like OpenClaw, the open-source AI agent platform that executes multi-step workflows, calls external tools, and runs continuously to complete real tasks. NVIDIA's NemoClaw extends that model with policy-based privacy and security guardrails for production and enterprise deployments. Each agentic task consumes significantly more tokens than a conventional AI interaction, driving sustained and growing demand. Today, most of that token spend flows to hyperscalers and foundation model companies. Token Factory enables infrastructure operators to serve that demand in their regions, on their own terms and at attractive prices, turning GPU capacity into a token-based revenue stream.
Transforming GPU providers into AI factories
Token Factory changes the competitive equation for neoclouds and sovereign AI clouds. Rather than competing solely on GPU availability and hourly pricing, operators can immediately begin to monetize AI model consumption with a suite of governance, access control and quota management capabilities, all with a consumption model their users already understand. Token Factory becomes the controlled delivery plane for AI models, while frameworks such as OpenClaw and NemoClaw become the token-hungry "applications" that drive consumption.
"Token Factories are the new cellphone companies," said Haseeb Budhani, CEO and co-founder of Rafay Systems. "Similar to how cellphone companies used to sell pre- and post-paid minute plans, AI factories are beginning to sell pre- and post-paid token plans. Team Rafay is looking forward to supporting the success of a thousand AI factories across the world with our Token Factory offering."
How Rafay's Token Factory works
Token Factory extends the Rafay Platform with a purpose-built monetization and metering layer for AI services. It enables AI factory operators to expose AI models via API endpoints. Endpoints are token-metered and provide a number of price, access management and quota definition capabilities, making it easy for both enterprises and retail users to track token consumption and enforce policies in real time across users, applications and agentic workflows.
Token Factory has been validated to work with OpenClaw and NVIDIA NemoClaw, which are driving the highest-velocity token consumption in the market today. Users with OpenClaw or NemoClaw setups point their rigs to API endpoints made available to them through developer-friendly, self-service workflows, and instantly start consuming AI services through a clean, tokenized interface. The complexity of GPU-based hardware, connectivity, access control, scaling, etc. is invisible to end users.
A growing market for token-based AI services
At GTC 2026, NVIDIA CEO Jensen Huang elevated the concept of "tokenomics" to a keynote theme, describing tokens as a new commodity and envisioning a future in which token-based access becomes the standard way enterprises and developers consume AI. The GPU-as-a-Service market is projected to reach $7.36 billion in 2026 and grow to $26.43 billion by 2031, according to Research and Markets. Meanwhile, sovereign AI investment is accelerating globally, with IDC projecting that by 2028, 60% of multinational firms will split their AI stacks across sovereign zones.
As more organizations build or invest in AI factories, the challenge is shifting from provisioning GPU infrastructure to monetizing it. Token Factory addresses that challenge directly, giving operators a ready-made system to offer consumption-based AI services rather than building one in-house.
Already deployed with AI factory operators worldwide
Token Factory builds on Rafay's partnerships with AI factory operators across six continents. The Rafay Platform powers sovereign and neocloud AI deployments for customers including Cassava Technologies, which is deploying Africa's first NVIDIA-powered AI factories; Firmus Technologies, which has integrated Rafay's PaaS capabilities into its green-energy-powered Australian AI Cloud; and Telus, which is building a sovereign AI Studio in Canada. Additional deployments span the Middle East, Latin America and Southeast Asia.
Token Factory is available now as part of the Rafay Platform. For more information, visit rafay.co/platform/ai-token-factory.
About Rafay Systems
Rafay Systems is a leading platform provider for modern infrastructure and AI workloads, delivering Platform-as-a-Service (PaaS) capabilities that enable organizations to operationalize compute infrastructure with self-service automation, governance and multi-tenancy. The Rafay Platform helps enterprises, cloud providers and sovereign AI cloud operators transform raw infrastructure into fully operational platforms for AI, Kubernetes and cloud-native applications. By simplifying infrastructure orchestration and lifecycle management, Rafay enables organizations to accelerate innovation while maintaining security, consistency and operational control. For more information, visit rafay.co.
MEDIA CONTACT
Cristin Connelly
Cathey.co for Rafay System
cristin@cathey.co
View original content to download multimedia:https://www.prnewswire.com/news-releases/rafay-systems-transforms-gpu-providers-into-ai-factories-by-empowering-them-to-monetize-token-metered-access-to-ai-models-302733266.html
SOURCE Rafay Systems
