Claude API in Python Cheat Sheet

Build with the Claude API in Python. This cheat sheet covers the Anthropic SDK essentials — messages, streaming, vision, tool use, embeddings, and token counting — with ready-to-run code.

2026年5月22日 · 5 分読む

Have this cheat sheet at your fingertips

Download PDF

Claude is Anthropic's family of large language models for reasoning, analysis, coding, and multimodal understanding. This cheat sheet covers the Anthropic Python SDK essentials — sending messages, streaming, vision, tool use, embeddings, and token counting — with ready-to-run code.

Getting started

Install the Python SDK

pip install anthropic

Set your API key

Set it once in your shell:

export ANTHROPIC_API_KEY="your_api_key_here"

Or from Python:

import os
os.environ["ANTHROPIC_API_KEY"] = "your_api_key_here"

Create a Claude client

from anthropic import Anthropic

# Initialize the Claude client using your API key
claude_client = Anthropic()

Overview

Core SDK methods

Send messages with claude_client.messages.create()
Stream responses with claude_client.messages.stream()
Generate embeddings with claude_client.embeddings.create()
Estimate tokens with claude_client.messages.count_tokens()
List models with claude_client.models.list()

Key jargon

Model — a specific Claude checkpoint.
Messages API — the primary conversational endpoint.
System message — used to define assistant behavior and constraints.
Conversation — full message history sent every request.
Streaming — receive output incrementally.
Tool use — allow Claude to call external tools.
Embeddings — vector representations of text.
Temperature — controls the randomness of model output.
Token counting — estimate cost before sending.
Context window — maximum tokens allowed in a request (input + output).

Workflows

1. Basic message

Use for one-off prompts without conversation state.

# Send a user message with generation controls
response = claude_client.messages.create(
    model="claude-sonnet-4-6",
    max_tokens=300,  # Always set token limit
    messages=[
        {"role": "user",
         "content": "What is the best LLM?"}
    ],
)

# Claude returns structured content blocks
print(response.content[0].text)

2. Count tokens before sending

Estimate cost or prevent context window overflow.

# Estimate how many tokens the input will use
token_count = claude_client.messages.count_tokens(
    model="claude-opus-4-6",
    messages=[{
        "role": "user",
        "content": "Explain machine learning like I'm five."}],
)

# Access estimated input token count
print(token_count.input_tokens)

3. Multi-turn conversation

Maintain conversation state client-side and resend the full history each request.

# Define conversation including a system instruction
conversation_history = [
    {"role": "system",
     "content": "You are a concise technical assistant."},
    {"role": "user",
     "content": "Explain transformers simply."},
    {"role": "assistant",
     "content": "Transformers model sequences efficiently."},
    {"role": "user",
     "content": "How are they different from RNNs?"}
]

# Send conversation history to Claude
response = claude_client.messages.create(
    model="claude-sonnet-4-6",
    max_tokens=300,
    messages=conversation_history,  # Send full history
)

# Print assistant reply
print(response.content[0].text)

4. Streaming responses

Use for long outputs or real-time interfaces.

# Stream response to reduce perceived latency
with claude_client.messages.stream(
    model="claude-sonnet-4-6",
    max_tokens=300,
    messages=[{
        "role": "user",
        "content": "Write a poem about AI."}],
) as stream:
    # Print text as it is generated
    for chunk in stream.text_stream:
        print(chunk, end="", flush=True)

5. Vision and document Q&A

Send images or PDFs alongside text questions.

import base64

# Encode an image file
with open("invoice.png", "rb") as img_file:
    encoded_img = base64.b64encode(img_file.read()).decode("utf-8")

# Send image and question to Claude
vision_response = claude_client.messages.create(
    model="claude-sonnet-4-6",
    max_tokens=300,
    messages=[{
        "role": "user",
        "content": [
            {
                "type": "image",
                "source": {
                    "type": "base64",
                    "media_type": "image/png",
                    "data": encoded_img
                }
            },
            {"type": "text",
             "text": "What is the invoice total?"}
        ],
    }],
)

# Print assistant reply
print(vision_response.content[0].text)

6. Tool use

Allow Claude to request functionality from other software.

# Define a tool schema
weather_tool = {
    "name": "get_weather",
    "description": "Get current weather for a city",
    "input_schema": {
        "type": "object",
        "properties": {"city": {"type": "string"}},
        "required": ["city"]
    }
}

# Send request with tool definitions
tool_response = claude_client.messages.create(
    model="claude-opus-4-6",
    max_tokens=300,
    tools=[weather_tool],  # Provide tool schema
    messages=[{
        "role": "user",
        "content": "What's the weather in London?"}],
)

# Print assistant reply (may include tool call request)
print(tool_response.content)

7. Embeddings for search and RAG

Generate vectors for retrieval systems. Anthropic recommends using Voyage AI embeddings.

# Install the official Voyage AI package
pip install -U voyageai

# Use the API key from your environment
from voyageai import Client

# Initialize a Voyage AI client (reads VOYAGE_API_KEY)
voyage = Client()

# Generate an embedding vector for semantic search
embedding_response = voyage.embed(
    ["Semantic search example text"],
    model="voyage-4"
)

# Access the embedding vector
embedding_vector = embedding_response.embeddings[0]
print(len(embedding_vector))  # dimensionality (e.g., 1024)

8. Model discovery

List and pin models for stable production systems.

# Retrieve available models
available_models = claude_client.models.list()

# Print model IDs
for model in available_models.data:
    print(model.id)

トピック

Artificial Intelligence

Python

Continue your Claude API journey

Tracks

開発者向けアソシエイトAIエンジニア

26時間

APIやオープンソースライブラリを使って、ソフトウェアアプリケーションにAIを統合する方法を学びます。 AIエンジニアになるための旅を今日始めましょう！

詳細を見る

コースを開始

Courses

Claude モデル入門

3時間

11.4K

Anthropic API を用い、AI アプリの構築やビジネスの課題解決に Claude を活用する方法を学びます。

詳細を見る

コースを開始

Courses

Software Development with Claude Code

4時間

4.6K

Claude Code brings AI assistance to your terminal. Learn the workflows that turn it into a reliable tool for real software development.

詳細を見る

コースを開始

The OpenAI API in Python

ChatGPT and large language models have taken the world by storm. In this cheat sheet, learn the basics on how to leverage one of the most powerful AI APIs out there, then OpenAI API.

Richie Cotton

tutorials

Getting Started with the Claude 2 and the Claude 2 API

The Python SDK provides convenient access to Anthropic's powerful conversational AI assistant Claude 2, enabling developers to easily integrate its advanced natural language capabilities into a wide range of applications.

Abid Ali Awan

tutorials

Claude Sonnet 3.5 API Tutorial: Getting Started With Anthropic's API

To connect through the Claude 3.5 Sonnet API, obtain your API key from Anthropic, install the anthropic Python library, and use it to send requests and receive responses from Claude 3.5 Sonnet.

Ryan Ong

tutorials

Claude Code Tutorial: Setup, Refactoring, and Debugging in Practice

Learn how to use Anthropic's Claude Code to improve software development workflows through a practical example using the Supabase Python library.

Aashi Dutt

tutorials

Getting Started with Claude 3 and the Claude 3 API

Learn about the Claude 3 models, detailed performance benchmarks, and how to access them. Additionally, discover the new Claude 3 Python API for generating text, accessing vision capabilities, and streaming.

Abid Ali Awan

tutorials

Claude Sonnet 4: A Hands-On Guide for Developers

Explore Claude Sonnet 4’s developer features—code execution, files API, and tool use—by building a Python-based math-solving agent.

Bex Tuychiev

もっと見るもっと見る

Getting started

Install the Python SDK

Set your API key

Create a Claude client

Overview

Core SDK methods

Key jargon

Workflows

1. Basic message

2. Count tokens before sending

3. Multi-turn conversation

4. Streaming responses

5. Vision and document Q&A

6. Tool use

7. Embeddings for search and RAG

8. Model discovery

The OpenAI API in Python

Getting Started with the Claude 2 and the Claude 2 API

Claude Sonnet 3.5 API Tutorial: Getting Started With Anthropic's API

Claude Code Tutorial: Setup, Refactoring, and Debugging in Practice

Getting Started with Claude 3 and the Claude 3 API

Claude Sonnet 4: A Hands-On Guide for Developers

.css-1531qan{-webkit-text-decoration:none;text-decoration:none;color:inherit;}開発者向けアソシエイトAIエンジニア

Claude モデル入門

Software Development with Claude Code

The OpenAI API in Python

Getting Started with the Claude 2 and the Claude 2 API

Claude Sonnet 3.5 API Tutorial: Getting Started With Anthropic's API

Claude Code Tutorial: Setup, Refactoring, and Debugging in Practice

Getting Started with Claude 3 and the Claude 3 API

Claude Sonnet 4: A Hands-On Guide for Developers

開発者向けアソシエイトAIエンジニア