Kagi Assistant

Kagi Assistant combines the top large language models (LLMs) with optional results from Kagi Search, making it the perfect companion for creative, research, and programming tasks — alongside everything else you can think of! All this is included in a single subscription!

Features

Access to the latest and most performant large language models from OpenAI, Anthropic, Meta, Google, Mistral, Amazon, Alibaba, and DeepSeek
Multiple custom assistants
The ability to control whether the Assistant has web access (powered by Kagi Search)
Applying Kagi Search Lenses and Personalized Results to the Assistant searches
Saving Assistant threads
Uploading files to use as context
Altering the Assistant configuration within the thread
- For example, you can ask the initial question with web access enabled and then disable it for subsequent questions!
- It is also possible to switch to a different LLM in the middle of a thread
Code syntax highlighting
Keyboard Shortcuts
Export conversations to markdown
Share threads with others using a link
Voice input

Privacy

When you use the Assistant by Kagi, your data is never used to train AI models (not by us or by the LLM providers), and no account information is shared with the LLM providers. By default, threads are deleted after 24 hours of inactivity. This behavior can be adjusted in the settings.

Using the Assistant

Kagi Assistant can be accessed via the apps menu located in the top right corner of all Kagi pages or by using bangs in search. You can also use this direct link.

When you first access the Assistant, you will be greeted by a familiar-looking landing page, allowing you to get right into using it. You can either type your prompt or use voice input by pressing the microphone symbol. You can choose which LLM you wish to use by opening the dropdown menu just below the prompt field.

The Assistant's web access can be toggled via the button below the prompt field.

Which model to choose

There is no definite answer to the question of what the best LLM is. As the number of competing models increases, users may find it difficult to find the right one for their task. To aid in this, Kagi maintains a list of recommended models at the top of the LLM list.

Screenshot showing the recommended models in Assistant model selection menu

Kagi recommended models as of July 27, 2025.

The recommendations are based on the Kagi LLM Benchmarking Project. The benchmark tests measure model quality in various scenarios.

Another important aspect is the privacy policy of the model provider. See our LLM Privacy Comparison for a detailed overview of how each provider handles your data.

Threads

The Assistant supports threads, allowing you to keep your bagel topping ideas separate from your weekend projects.

The search bar enables you to search for that one elusive thread.

By default, threads are kept for 24 hours after the last message. If keeping threads alive permanently better fits your workflow, you can adjust this setting in Assistant Settings. Please note that the thread saving setting is applied when the thread is created.

Threads can be renamed, downloaded, shared, and deleted via the ⋮ button which is displayed when you hover over the thread. Threads can be further organized by adding tags to them.

Uploading Files to Assistant

Kagi Assistant supports file uploads, allowing you to provide additional context or information for your queries.

This can be useful for tasks like:

Summarizing a document
Extracting key insights from a report
Analyzing data in a spreadsheet
Describing an image
Distilling main points from an audio file

To upload a file:

Click the paperclip icon in the prompt input box.
Select the file or image you wish to upload.
Provide a prompt with instructions to process the file or leave it blank to summarize it.

Important considerations for file uploads:

File size limit: The maximum file size for uploads is 16MB.
Processing time: Larger files may take a few moments to process.
Context retention: Uploaded file content remains in the conversation context for subsequent messages.

The Assistant supports various file formats across different categories, including:

File Type	Supported Formats
Text	txt, text, md (and other text-based formats)
Rich Format	pdf, docx, pptx
Spreadsheets	csv, tsv, xlsx, json, jsonl
Image	jpg, jpeg, png, gif, tiff, tif, webp
Audio	3gpp, aa, aac, aax, act, aiff, amr, ape, au, awb, dct, dss, dvf, flac, gsm, iklax, ivs, m4a, m4b, m4p, mp4, mmf, mp3, mpc, msv, ogg, opus, ra, rm, sln, tta, vox, wav, wma, wvpla

Note: Unsupported formats may be treated as binary files.

Fetching online content

Assistant can fetch webpages and online documents (up to 50 MB) to use them as context for your conversation. To use this feature, simply paste the URL in your Assistant conversation (make sure the Entire Web toggle is on).

Custom Instructions

Do you prefer a more personalized Assistant experience? You can provide custom instructions in the Assistant Settings. These instructions can be utilized to refine the Assistant's responses. You can, for instance, instruct the Assistant to be more succinct or to consider your profession and location.

Custom Assistants

You can create Custom Assistants in the Assistant Settings. It is possible to customize the LLM, settings (the use of web access, lenses, and personalized results), and the instructions for each Custom Assistant.

Assistant comes with a built-in Code Custom Assistant that is optimized for programming tasks. It uses Claude 4 Sonnet and has web access.

For more details, refer to the Custom Assistants page.

Keyboard Shortcuts

The following keyboard shortcuts are available in Assistant on Mac and PC.

Mac Shortcut	Action
⌘ + K	New Thread
⌘ + Shift + S	Toggle Sidebar
⌘ + Shift + C	Copy Last Response
⌘ + Shift + E	Edit Last Message
⌘ + Shift + Backspace	Delete Current Thread
⌘ + /	Focus Prompt Box
⌘ + .	Show Keyboard Shortcuts

PC Shortcut	Action
Ctrl + K	New Thread
Ctrl + Shift + S	Toggle Sidebar
Ctrl + Shift + C	Copy Last Response
Ctrl + Shift + E	Edit Last Message
Ctrl + Shift + Backspace	Delete Current Thread
Ctrl + /	Focus Prompt Box
Ctrl + .	Show Keyboard Shortcuts

Available LLMs

Developer	Model	Plan
Alibaba	Qwen 3 235B	All
Alibaba	Qwen 3 235B (reasoning)	All
Alibaba	Qwen 3 Coder	All
Anthropic	Claude 4.5 Haiku	Ultimate
Anthropic	Claude 4.5 Sonnet	Ultimate
Anthropic	Claude 4.1 Opus	Ultimate
Anthropic	Claude 4.5 Sonnet (Reasoning)	Ultimate
Anthropic	Claude 4.1 Opus (Reasoning)	Ultimate
Anthropic	Claude 4.5 Haiku (Reasoning)	Ultimate
Deepseek	DeepSeek Chat V3.1 Terminus	All
Deepseek	DeepSeek R1	Ultimate
Google	Gemini 2.5 Flash	All
Google	Gemini 2.5 Flash Lite	All
Google	Gemini 2.5 Pro	Ultimate
Meta	Llama 4 Maverick	All
Mistral AI	Mistral Small	All
Mistral AI	Mistral Medium	All
Mistral AI	Mistral Large	Ultimate
Moonshot AI	Kimi K2	All
Nous Research	Hermes-4-405B	All
Nous Research	Hermes-4-405B (reasoning)	All
OpenAI	GPT 5 Mini	All
OpenAI	GPT 5 Nano	All
OpenAI	GPT OSS 120B	All
OpenAI	GPT OSS 20B	All
OpenAI	GPT 4.1 mini	All
OpenAI	GPT 4.1 nano	All
OpenAI	GPT 4.1	Ultimate
OpenAI	GPT 5	Ultimate
OpenAI	GPT 5 Codex	Ultimate
OpenAI	o4 mini	Ultimate
OpenAI	o3	Ultimate
OpenAI	o3 pro	Ultimate
OpenAI	ChatGPT	Ultimate
xAI	Grok Code Fast 1	All
xAI	Grok 4 Fast	All
xAI	Grok 4 Fast (Reasoning)	All
xAI	Grok 4	Ultimate
Z.ai	GLM-4.6 (preview)	All
Z.ai	GLM-4.6 (reasoning) (preview)	All

You can learn more about how these models compare in the Kagi LLM Benchmarking Project page.

For more information about each model and its privacy practices, including details about providers, see our LLM Privacy page.

Bangs

You can quickly access Assistant using the following bangs:

!ai, !as, !assistant, !research, !answer, !discuss, !expert, !llm, !custom, and !asst: These bangs direct you to the general Assistant interface for various types of queries.
!chat: This bang accesses Assistant with internet access turned off.
!code: Use this bang to access the built-in Code Custom Assistant, which is tailored for coding-related queries.
!ki: This bang accesses Assistant with the Ki profile, providing a specialized interaction.

Each bang is designed to optimize your search experience by directing you to the most appropriate version of Assistant for your needs.

URL Parameters

You can specify a particular model in the Assistant's URL by including a profile parameter. https://kagi.com/assistant?profile=gpt-5 The available model names can be found in the table above.

This can also be used with custom assistants, as described on the custom assistant documentation.

The internet parameter can be used to turn on and off internet access, set to true to enable, anything else to disable. This overrides the internet setting of the profile used.

The lens parameter can be used to set the lens if internet access is enabled. The value of this is the lowercase format of the lens name, for example, https://kagi.com/assistant?lens=programming will use the Programming lens.

The q parameter can be used to submit a prompt immediately after the page loads. The qvalue parameter can be used to prefill the prompt box without submitting it.

Here is an example of a URL that enables internet access, uses the Claude 4 Sonnet model, applies the Recipes lens, and submits a prompt immediately. You might use it as a target for a custom bang. https://kagi.com/assistant?profile=claude-4-sonnet&internet=true&lens=recipes&q=%s

Availability

Assistant is available to all members. However, premium models are only available in our Ultimate plan. If you are on a different plan and you need access to these models, you can upgrade from the Billing Settings page.

We also offer an Ultimate upgrade for Family Plans. You can upgrade from the Family Management page.

Usage Limits

Context window limit

There's no fixed limit on conversation length. We automatically optimize lengthy chats behind the scenes to maintain performance.

Input limitations

Text input

Maximum 100,000 characters per message
Text exceeding this limit will be automatically truncated

File uploads

Maximum total size: 16 MB (applies to single or multiple files)
URL content: 50 MB maximum retrievable size

Custom Instructions

Maximum 20,000 characters for custom Assistant instructions

Fair Use Policy

We use a value-based usage system to maintain high-quality service for all users:

Your monthly plan determines your token usage allowance.
- For example, a $25 monthly plan provides up to $25 worth of token usage across all models.
For yearly plans, you get access to the full year's worth of token usage at the start of the plan.
- For instance, the Ultimate yearly plan allows up to $270 worth of token usage for the entire year.
A 20% margin markup is included in token usage cost calculations to cover search queries, infrastructure, and development costs.
- For example, $25 token usage consists of $20 for raw token costs and $5 for operational costs.
Users will receive an in-app reminder as they near their usage limit. If the limit is exceeded, new AI interactions will be disabled until they either renew their plan early or the next billing cycle begins.
- Note: We will soon introduce the option to purchase top-up credits, allowing you to extend Assistant usage beyond fair-use limits with an amount of your choice. These credits can then also be used for other Kagi products such as the API.

For additional questions about these limitations or policies, please contact our support team.

Tips to reduce token usage

Here are some suggestions to reduce token usage:

Use less expensive models for simple tasks like summarization or basic information extraction. Our LLM Benchmarking project page contains cost information for the different models.
Create new threads for unrelated questions rather than continuing in the same conversation.
Be specific and concise in your prompts to get more focused responses.
Use the "Edit Prompt” feature (pencil icon) to refine your question instead of sending multiple clarifications.
Disable web access when you don't need internet information.
Limit file uploads to only what's necessary for your query.
Break complex tasks into smaller, focused questions across multiple threads.
Use custom instructions to request consistently concise responses.
Leverage specialized custom assistants optimized for specific tasks.
Download and delete completed threads to avoid accidentally continuing old conversations.

FAQ

Q: What is Kagi’s stance about using LLMs in search?
A: We continue to relentlessly focus on the core search experience and build thoughtfully integrated features on top of it. Read more about it in our AI Integration Philosophy page.

Company

Plans & Payment

Support and Community

Contribute

Privacy & Security

Results

Getting Started

Search Features

AI Features

Settings

Search

API

Introduction

Kagi Assistant

Features

Privacy

Using the Assistant

Which model to choose

Threads

Tags

Uploading Files to Assistant

Fetching online content

Custom Instructions

Custom Assistants

Keyboard Shortcuts

Available LLMs

Bangs

URL Parameters

Availability

Usage Limits

Context window limit

Input limitations

Text input

File uploads

Custom Instructions

Fair Use Policy

Tips to reduce token usage

FAQ

Search

Introduction

Kagi Assistant ​

Features ​

Privacy ​

Using the Assistant ​

Which model to choose ​

Threads ​

Tags ​

Uploading Files to Assistant ​

Fetching online content ​

Custom Instructions ​

Custom Assistants ​

Keyboard Shortcuts ​

Available LLMs ​

Bangs ​

URL Parameters ​

Availability ​

Usage Limits ​

Context window limit ​

Input limitations ​

Text input ​

File uploads ​

Custom Instructions ​

Fair Use Policy ​

Tips to reduce token usage ​

FAQ ​

Kagi Assistant

Features

Privacy

Using the Assistant

Which model to choose

Threads

Tags

Uploading Files to Assistant

Fetching online content

Custom Instructions

Custom Assistants

Keyboard Shortcuts

Available LLMs

Bangs

URL Parameters

Availability

Usage Limits

Context window limit

Input limitations

Text input

File uploads

Custom Instructions

Fair Use Policy

Tips to reduce token usage

FAQ