Ask or search…


Ollama is a local inference framework client that allows one-click deployment of LLMs such as Llama 2, Mistral, Llava, etc. Dify supports integrating LLM and Text Embedding capabilities of large language models deployed with Ollama.

Quick Integration

Download and Launch Ollama

  1. 1.
    Download Ollama
    Visit https://ollama.ai/download to download the Ollama client for your system.
  2. 2.
    Run Ollama and Chat with Llava
    ollama run llava
    After successful launch, Ollama starts an API service on local port 11434, which can be accessed at http://localhost:11434.
    For other models, visit Ollama Models for more details.
  3. 3.
    Integrate Ollama in Dify
    In Settings > Model Providers > Ollama, fill in:
    • Model Name: llava
    • Base URL: http://<your-ollama-endpoint-domain>:11434
      Enter the base URL where the Ollama service is accessible.
      If Dify is deployed using docker, consider using the local network IP address, e.g., or the docker host machine IP address, e.g.,
      For local source code deployment, use http://localhost:11434.
    • Model Type: Chat
    • Model Context Length: 4096
      The maximum context length of the model. If unsure, use the default value of 4096.
    • Maximum Token Limit: 4096
      The maximum number of tokens returned by the model. If there are no specific requirements for the model, this can be consistent with the model context length.
    • Support for Vision: Yes
      Check this option if the model supports image understanding (multimodal), like llava.
    Click "Save" to use the model in the application after verifying that there are no errors.
    The integration method for Embedding models is similar to LLM, just change the model type to Text Embedding.
  4. 4.
    Use Ollama Models
    Enter Prompt Eng. page of the App that needs to be configured, select the llava model under the Ollama provider, and use it after configuring the model parameters.
For more information on Ollama, please refer to: https://github.com/jmorganca/ollama


For more information on Ollama, please refer to: Ollama FAQ

Setting environment variables on Mac

If Ollama is run as a macOS application, environment variables should be set using launchctl:
  1. 1.
    For each environment variable, call launchctl setenv.
    launchctl setenv OLLAMA_HOST ""
  2. 2.
    Restart Ollama application.

Setting environment variables on Linux

If Ollama is run as a systemd service, environment variables should be set using systemctl:
  1. 1.
    Edit the systemd service by calling systemctl edit ollama.service. This will open an editor.
  2. 2.
    For each environment variable, add a line Environment under section [Service]:
  3. 3.
    Save and exit.
  4. 4.
    Reload systemd and restart Ollama:
    systemctl daemon-reload
    systemctl restart ollama

Setting environment variables on Windows

On windows, Ollama inherits your user and system environment variables.
  1. 1.
    First Quit Ollama by clicking on it in the task bar
  2. 2.
    Edit system environment variables from the control panel
  3. 3.
    Edit or create New variable(s) for your user account for OLLAMA_HOST, OLLAMA_MODELS, etc.
  4. 4.
    Click OK/Apply to save
  5. 5.
    Run ollama from a new terminal window

How can I expose Ollama on my network?

Ollama binds port 11434 by default. Change the bind address with the OLLAMA_HOST environment variable.