deepseek-ocr

Here are 25 public repositories matching this topic...

oomol-lab / pdf-craft

PDF craft can convert PDF files into various other formats. This project will focus on processing PDF files of scanned books.

pdf ocr document deepseek-ocr

Updated Dec 10, 2025
Python

magicyuan876 / mineru-tianshu

Star

pdf-converter markitdown mineru mcp-server deepseek-ocr paddleocr-vl

Updated Dec 10, 2025
Python

th1nhhdk / local_ai_ocr

Star

An local, offline (after initial setup), portable OCR software that can process images and PDF files, using DeepSeek-OCR AI (running directly on your machine).

multilingual ocr ai local offline portable vietnamese english multilanguage llm vllm deepseek-ocr

Updated Dec 9, 2025
Python

MaxDevv / Un-LOCC

Star

Un-LOCC: Universal Lossy Optical Context Compression for Vision-Based Language Models Achieve nearly 3x token compression at over 93% retrieval accuracy using existing Vision-Language Models.

compression ai vision-language-model long-context-llm deepseek-ocr

Updated Oct 24, 2025
Python

athrael-soju / Snappy

Star

🐊 Snappy's unique approach unifies vision-language late interaction with structured OCR for region-level knowledge retrieval. Like the project? Drop a star! ⭐

Updated Dec 9, 2025
Python

Cross2pro / DeepSeek-OCR-Dashboard

Star

An out-of-the-box local Web UI for DeepSeek-OCR. Built with FastAPI + Vue.js, it supports PDF/Image uploads, progress tracking, and result visualization with bounding boxes. Easily experience the power of a top-tier OCR model.

ocr computer-vision text-recognition research-tool image-to-text optical-character-recognition document-analysis multimodal math-ocr latex-ocr llm large-language-model pdf-ocr deepseek deepseek-ocr ocr-webservice

Updated Dec 6, 2025
Python

BukeLy / rag-api

Star

Multi-tenant RAG API powered by LightRAG/RAG-Anything. Auto-selects best parser (DeepSeek-OCR/MinerU/Docling) via complexity scoring

python multi-tenant ai knowledge-graph semantic-search rag fastapi qdrant retrieval-augmented-generation mineru lightrag deepseek-ocr

Updated Dec 2, 2025
Python

reuAC / Tools_DeepSeekOCR

Star

A Windows-based screenshot OCR utility powered by DeepSeek-OCR. This tool allows users to quickly capture screen regions and perform high-accuracy Optical Character Recognition (OCR) directly on the captured image, leveraging the powerful DeepSeek-OCR model. It supports local model deployment and features real-time model output streaming.

python windows real-time ocr ai computer-vision gpu-acceleration ocr-python screenshot-tool deepseek-ocr

Updated Oct 21, 2025
Python

athrael-soju / little-scripts

Star

A monorepo containing various utility scripts, tools, and applications for development, automation, and AI-powered tasks.

text-to-speech ocr computer-vision cuda speech-to-text gradio fastapi vector-search document-understanding qdrant paddle-ocr flash-attention rag-chatbot colpali deepseek-ocr

Updated Nov 30, 2025
Python

PRITHIVSAKTHIUR / Super-OCRs-Demo

Star

A Gradio-based demo application for comparing state-of-the-art OCR models: DeepSeek-OCR, Dots.OCR, HunyuanOCR, and Nanonets-OCR2-3B.

python ocr pillow torch accelerate supervision gradio opencv-python nanonets torchvision sentencepiece huggingface-transformers huggingface-spaces flash-attention-2 hunyuan qwen2-5-vl dots-ocr deepseek-ocr easydict

Updated Nov 28, 2025
Python

neosantara-xyz / deepseek-ocr-api

Star

Host your own DeepSeek OCR in easy way through modal serverless compute

inference-engine inference-api deepseek-api deepseek-ocr

Updated Nov 12, 2025
Python

MichaelWalker-git / deepseek_ocr

Star

Self hosting your own DeepSeek OCR model in AWS

self-hosted llm deepseek-ocr

Updated Nov 21, 2025
Python

aws-samples / sample-deepseek-ocr-selfhost

Star

Here is a way to self host the Deep Seek OCR model in AWS without Bedrock. This allows for you to run OCR jobs at the scale you need without the limits of token costs.

ocr self-host llm deepseek-ocr

Updated Nov 19, 2025
Python

PRITHIVSAKTHIUR / DeepSeek-OCR-experimental

Star

A Gradio-powered web interface for performing advanced OCR tasks using the DeepSeek-OCR model. This experimental app leverages Hugging Face Transformers to process images for text extraction, document conversion, figure parsing, and object localization.

torch pytorch accelerate matplotlib addict gradio opencv-python torchvision i64 huggingface-transformers timm bf16 huggingface-spaces flash-attention-2 deepseek-ocr easydict

Updated Nov 4, 2025
Python

Moenupa / DeOCR

Star

A high-performance highly-customizable reverse OCR tool that renders text or huggingface-compatible datasets to images. Dimension, DPI, CSS configurable!

python ocr asynchronous data-preprocessing image-rendering multimodal huggingface-datasets multimodal-large-language-models mllm-evaluation deepseek-ocr