Question 1

What open-source models are currently supported on the Together Inference platform?

Accepted Answer

Together AI supports a wide range of state-of-the-art open-source models including Llama 3, Mixtral, Qwen, and Stable Diffusion. We constantly update our library to include the latest releases, allowing you to run inference on high-performance models without managing your own infrastructure.

Question 2

Can I fine-tune models using my own datasets on Together AI?

Accepted Answer

Yes, Together AI provides a dedicated fine-tuning API that allows you to customize open-source models with your proprietary data. Our infrastructure is optimized for efficient training, enabling you to create specialized versions of models like Llama for your specific business use cases.

Question 3

What is the Together GPU Clusters service and how does it differ from the Inference API?

Accepted Answer

While the Inference API provides serverless access to models, GPU Clusters offer dedicated H100 or A100 instances for large-scale training and research. These clusters are interconnected with high-speed networking, making them ideal for organizations building their own foundational models from scratch.

Question 4

How quickly can I deploy a model and start making API calls?

Accepted Answer

You can get started in minutes by creating an account, generating an API key, and using our playground to test prompts. Our serverless infrastructure handles the deployment automatically, so there is no need to manually provision servers or manage complex Kubernetes clusters.

Question 5

What resources are available for migrating from a local deployment to Together AI?

Accepted Answer

We provide comprehensive documentation, migration guides, and code samples to help you move your workloads from local hardware to our cloud. Our support team can also assist in optimizing your inference parameters to ensure you achieve the best performance-to-cost ratio.

Question 6

Is the Together AI API compatible with existing OpenAI client libraries?

Accepted Answer

Together AI offers an OpenAI-compatible API, meaning you can often switch your existing integrations by simply changing the base URL and API key. This allows developers to migrate their applications to open-source models with minimal code changes and no architectural overhaul.

Question 7

Which programming languages and SDKs are officially supported?

Accepted Answer

We provide official SDKs for Python and TypeScript/JavaScript to streamline your development process. Additionally, because we use a standard RESTful API, you can integrate Together AI services into any environment that supports HTTP requests, including Go, Ruby, and Java.

Question 8

How does the consumption-based pricing work for the Inference API?

Accepted Answer

Our Inference API uses a transparent pay-as-you-go model based on the number of tokens processed (for LLMs) or images generated. This allows you to scale your application without upfront costs, only paying for the exact volume of requests your users generate each month.

Question 9

Are there volume discounts available for high-scale enterprise users?

Accepted Answer

Yes, we offer custom enterprise pricing and committed use discounts for customers with high-volume requirements. If your application processes millions of tokens daily or requires dedicated capacity, our sales team can structure a plan that reduces your effective cost per request.

Question 10

How does Together AI handle the privacy of data sent to the Inference API?

Accepted Answer

We prioritize data privacy and do not use your input data or output generations to train our base models. All data is encrypted in transit using TLS, and we offer enterprise-grade security features to ensure that your proprietary information remains confidential during processing.

Question 11

Does Together AI offer SOC 2 compliance or other security certifications?

Accepted Answer

Together AI is committed to high security standards and maintains SOC 2 Type II compliance. We provide the necessary documentation and security controls required by enterprise legal and IT departments to ensure our infrastructure meets rigorous data protection requirements.

Question 12

What level of technical support is included with a standard account?

Accepted Answer

All users have access to our extensive documentation, community forums, and email support. Enterprise customers receive enhanced support packages, which include dedicated Slack channels, faster response times, and direct access to our engineering team for architectural reviews.

Together AI

Overview

About Together AI

Strengths

Limitations

Features

API & Developer

Core Features

Integrations

Together AI FAQ

Features & Capabilities

Getting Started

Integrations & Compatibility

Pricing & Plans

Security & Compliance

Support & Resources

Pricing Plans

Serverless Inference (Chat/Vision)

Dedicated Inference

GPU Clusters (On-demand)

GPU Clusters (Reserved)

Fine-Tuning

Managed Storage

Tags

Reviews & Ratings

Performance & Reliability

Compare Together AI with...