Baseten’s cover photo
Baseten

Baseten

Software Development

San Francisco, CA 24,647 followers

Own your inference.

About us

Inference is everything. Baseten is an AI infrastructure platform giving you the tooling, expertise, and hardware needed to bring great AI products to market - fast. Our proprietary Inference Stack utilizes the cutting-edge of performance research combined with highly performant and reliable infrastructure to give you out-of-the-box global availability with 99.99% of uptime.

Website
https://www.baseten.co/
Industry
Software Development
Company size
51-200 employees
Headquarters
San Francisco, CA
Type
Privately Held
Specialties
developer tools, software engineering, artificial intelligence, and machine learning

Products

Locations

Employees at Baseten

Updates

  • Baseten reposted this

    View profile for Amir Haghighat

    Baseten14K followers

    Almost half of US doctors use OpenEvidence everyday to converse with a world of medical knowledge and latest medical research breakthroughs. A few minutes of downtime can result in thousands of suboptimal clinical decisions. We’re proud they’ve chosen us to power their inference.

  • Baseten reposted this

    View profile for Dannie Herzberg

    Baseten10K followers

    Nearly half of all U.S. physicians rely on OpenEvidence every single day - at the bedside, in the operating room, and at the point of care. It surfaces research in real time, exactly when doctors need it most. Baseten powers the inference that makes this possible - a responsibility we take very seriously. Thank you, Team OpenEvidence, for what you do and for the trust.

  • Baseten reposted this

    OpenEvidence answers over 1 Million questions every day from more than half of practicing physicians in the US. Physicians-and their patients-need OpenEvidence to provide the most accurate, up to date information in those critical moments. Downtime or delays have real life consequences; we partner with Baseten to provide the inference infrastructure make sure our answers are always available when physicians are making those high stakes clinical decisions. Baseten came through our office recently to talk to us about why this is so important, watch below:

  • Gemma 4 is live on Baseten and available to all customers on day 0 via the Baseten model library. All models in the Gemma 4 family are multimodal, supporting text and image inputs with text output. Key capabilities include: -> Advanced reasoning and thinking -> Coding and function calling -> OCR for document understanding -> Long context windows up to 256K tokens But the most impressive is how Gemma 4 is pushing the boundaries of model architecture with innovations including alternative attention mechanisms, Proportional RoPE, Per-Layer Embeddings (PLE), KV-Cache Sharing, native aspect ratio handling for vision, and a smaller frame window for audio. All are designed to improve efficiency, accuracy, and scalability. Try it today: https://lnkd.in/gEVxUuxh

    • No alternative text description for this image
  • Baseten reposted this

    View organization page for NVIDIA

    5,011,692 followers

    Delivered performance, not peak chip specifications, drives AI factory productivity. Rigorous benchmarks are the only way to see past the noise. In MLPerf Inference v6.0, NVIDIA extreme co-design delivered the highest token output across the broadest range of models and scenarios. Maximizing token output drives down token cost and maximizes AI factory productivity. Read the blog post to dive into the details: https://nvda.ws/3OaEE5b Baseten, CoreWeave, MLCommons

  • What if LLMs could remember as humans do? LLM memory is either perfect and lossless or ultra-compressed. What does a slightly compressed working memory to extend its context window look like? Our researchers built a 7M-parameter perceiver that compresses KV caches 8x while retaining 90%+ factual retention. Unlike existing compaction methods, we trained a model to do this in a single forward pass. We see this as the first step toward models that actually learn from experience. Read here: https://lnkd.in/eqaVCPSv

    • No alternative text description for this image
  • We've had a great month of March! A brief recap: -> NVIDIA GTC, featuring book signing, ice cream, and swag -> KubeCon EMEA, including a 2000+ person House of Kube event -> AI engineering leaders dinners at Wolfsbane (SF) and Manhatta (NYC) -> Baseten-branded ice cream social in SF -> AI/ML trivia night in the West Village -> NYC office warming party Want to attend our next event? Sign up here https://lnkd.in/gNZuscJ4

  • Baseten reposted this

    View organization page for Rootly

    11,467 followers

    500+ engineers proved it at KubeCon: sometimes all people need is a chance to unwind after a long conference day. As snacks, wine, and soft drinks washed over the RAI's Boat House, SREs, DevOps, and Platform Engineers shared how they're keeping up with the faster-than-ever cycles in the industry. Thanks to Rootly, Upwind Security, Baseten, Checkly, Cloudsmith, MetalBear, FusionAuth, Echo, Twingate, and Spotify for Backstage for making the evening unforgettable.

    • No alternative text description for this image
    • No alternative text description for this image
    • No alternative text description for this image
    • No alternative text description for this image

Similar pages

Browse jobs

Funding

Baseten 6 total rounds

Last Round

Series D

US$ 150.0M

See more info on crunchbase