Voiceflow named a 2026 Best Software Award winner by G2
Read now

As the demand for high-performing language models continues to grow, more developers and organizations are seeking ways to run large language models (LLMs) locally. Whether it’s for privacy, customization, or avoiding the high costs of cloud-based APIs, local deployment puts full control in your hands.
DeepSeek R1—a powerful open-source LLM optimized for reasoning and problem-solving—offers an excellent balance between performance and accessibility. And with Ollama, setting it up locally is remarkably straightforward. In this guide, you’ll learn exactly how to install Ollama, run DeepSeek R1 on your own machine, and begin leveraging its capabilities—without relying on external services.
Deploying DeepSeek R1 on your local system offers a range of practical benefits:
Keep sensitive data on your own infrastructure, minimizing the risk of exposure, logging, or data retention associated with cloud APIs.
Avoid latency caused by API requests over the internet. With local inference, responses are faster and more consistent.
Say goodbye to pay-per-token charges or subscription fees. Once downloaded, the model runs freely on your hardware.
Running locally means you can experiment, fine-tune prompts, or integrate the model into bespoke workflows without platform limitations.
Once installed, DeepSeek R1 can run entirely offline—ideal for secure environments or when working without internet access.
Ollama is a lightweight tool that makes running LLMs locally simple and efficient. It handles downloading, installing, and running models behind the scenes, letting you focus on using the model—not configuring it.
Ollama supports a variety of models, including DeepSeek R1 and its smaller distilled versions. Whether you're working on macOS, Linux, or Windows, the setup process is smooth and consistent.
Before we dive into the installation steps, let’s clarify what the terminal (or “command line”) is, why it’s used, and how to access it on each operating system.
With that in mind, let’s proceed to the actual setup steps.
Open your terminal and run:
curl -fsSL https://ollama.com/install.sh | shAfter installation, confirm that Ollama is working by checking the version:
ollama --versionWith Ollama installed, you can download and launch DeepSeek R1 using a single command:
ollama run deepseek-r1This command will:
If your system can’t support the full-scale model (671B parameters), you can run a smaller version. Ollama supports various sizes such as 1.5B, 7B, 14B, and more.
Use this format:
ollama run deepseek-r1:XbReplace X with the desired model size. For example:
ollama run deepseek-r1:7bThis flexibility ensures you can run DeepSeek R1 even on modest hardware.
To make the model available for use in applications or scripts, you can run it as a background API server: ollama serve
Once you've started the Ollama server using:
ollama serveOllama exposes a local API (usually at http://localhost:11434) that allows you to interact with the model programmatically. You can use tools like curl, Postman, or Python to send requests.
Here's a simple example that sends a prompt to DeepSeek R1 and receives a response:
curl http://localhost:11434/api/chat -d '{
"model": "deepseek-r1",
"messages": [
{ "role": "user", "content": "What is the capital of France?" }
],
"stream": false
}'The server will return a JSON response containing the model's reply.
Now that DeepSeek R1 is up and running locally on your machine, you can explore a range of possibilities and tools to get the most out of your setup. Here are some ideas on where to go next:
Running DeepSeek R1 locally is just the beginning. With a bit of experimentation and the right tooling, you can transform your setup into a powerful, custom AI platform—free from cloud constraints and tailored to your unique needs.
If you’re looking to build a fully interactive voice or chat experience, Voiceflow provides a no-code conversation design platform that can connect to your locally running LLM:
By combining Voiceflow’s conversation-building tools with your locally running DeepSeek R1 instance, you can quickly prototype and deploy custom AI-driven assistants—without relying on external cloud APIs. This approach offers the best of both worlds: a user-friendly, no-code conversation flow builder on the front end, and a fully private, customizable large language model on the back end.
Running DeepSeek R1 locally with Ollama is a fast and reliable way to access cutting-edge AI capabilities without sacrificing privacy, speed, or flexibility. Whether you're building a prototype, developing an internal tool, or just experimenting with open-source AI, this setup gives you full autonomy—no cloud services required.
With just a few commands, you’ll have a high-performance language model running securely on your own machine. And as you grow more comfortable with the setup, you can extend it into full applications, integrate it with APIs, or even build retrieval-augmented generation pipelines.
Ready to take the next step? You now have everything you need to bring DeepSeek R1 into your local development environment.
