Plans
Explaining how our different subscription tiers work.
Featherless provides serverless access to models as well agent runtimes, enabling you to power AI-applications without needing to manage infrastructure.
Our plans are subscription and concurrency based. Allowing unlimited monthly requests with a fixed number of concurrent requests. Subscription tiers differ by the model sizes and context lengths offered.
Featherless offers consumer plans with smaller plans for interactive chat for assistants and role-playing, and two larger plans for agentic inference and coding plans.
Consumer Plans
Plan | Tier | Price (/month) | Features |
Chat | $10 |
| |
Chat | $25 |
| |
Agentic | $100 |
| |
Agentic | $200 |
|
*smaller models allow for higher concurrency than larger models. See more below.
Business Plans
Business plans are scalable, allowing users to purchase larger amounts of inference to power production applications - whether agent fleets or other AI applications.
Plan | Price (/unit/month) | Features |
$100 |
| |
$200 |
|
*For more info on how the concurrency limits work visit:
Concurrency Limits
Explaining how subscription tiers translate to concurrent inference call maximums.