01 / 09

SEEKDEEP · v—

SeekDeep

A local-first Discord assistant

NATHANNEUROTIC / SEEKDEEP-DISCORDBOT2026

02 / 09

THE PROBLEM

▸ EVERY OTHER DISCORD AI BOT

Your prompts leave the room.

PROBLEM 01

Cloud-only

Every message hits OpenAI, Anthropic, or some third-party API. Your conversation is now their training data — or their incident log.

PROBLEM 02

Pay per token

Heavy users hit rate limits, hit billing walls, or get throttled when usage spikes. The bot's silent when you need it most.

PROBLEM 03

One model fits none

The same GPT call handles chat, code, image prompts, vision — even though no single model is best at all four.

WHY SEEKDEEP02

03 / 09

THE SOLUTION

▸ THREE PILLARS

Run it on your GPU.

PILLAR 01

Local

Llama-3.1, Qwen-VL, SDXL all run on your card. Zero outbound API calls. Chat, image, archive, search history — all on the box that owns the silicon.

PILLAR 02

Routed

5 chat roles — default, quality, reasoning, fallback, lightweight. The router picks the right model for the task. Translation goes to Gemma-3n. Code goes to Phi-4.

PILLAR 03

Aware

Task-LRU swaps models in and out of VRAM as needed. Thrashing warnings before OOM. /gpu watch shows it live.

SEEKDEEP / DESIGN PRINCIPLES03

04 / 09

THE NUMBERS

▸ ON A 24 GB GPU

0outbound

API calls. Ever.

5chat model roles · task-LRU swapped

8image edit pipelines · SDXL + InstructPix2Pix + inpaint + upscale

10image style presets · 3 quality tiers

565smoke tests · npm run preflight in 1s

35releases since v10.0 · v— is current

HARD NUMBERS04

05 / 09

ARCHITECTURE

▸ THREE PROCESSES

Simple by construction.

EXTERNAL

Discord

WebSocket gateway

→

NODE BOT

index.js

869 KB · ESM · v20+

PYTHON

local_ai_server.py

FastAPI · :7865 · task-LRU

→

DOCKER

SearXNG

privacy metasearch · :8080

STACK · LOCAL05

06 / 09

MODELS · 24 GB BUDGET

VRAM math.

ROLE	MODEL	QUANT	VRAM
default_chat	Llama-3.1-8B-Instruct	4bit · NF4	5.1 GB
quality_text	Mistral-Nemo-Instruct-2407	4bit	7.2 GB
reasoning_code	microsoft/phi-4	4bit	9.1 GB
lightweight_chat	gemma-3n-E4B-it	4bit	3.1 GB
image	Dreamshaper-XL	fp16	6.8 GB
vision	Qwen2.5-VL-3B-Instruct	fp16	3.4 GB

▸ task-LRU keeps just one chat model resident · pin via LOCAL_*_KEEP_RESIDENT=on

VRAM06

07 / 09

THE GUI

▸ 14 surfaces · one visual system

Designed end to end.

01

Hub

project entry

02

Control

operator console

03

Chat

conversation client

04

Installer

9-step wizard

05

Docs

command reference

06

API

endpoint explorer

07

Architecture

system diagram

08

Roadmap

planned work

09

Changelog

release history

10

Memory

design preview

11

Landing

public overview

12

Tour

guided walkthrough

13

Mobile

phone-size mocks

14

Boot

startup animation

SURFACES07

08 / 09

ROADMAP

Where it's headed.

RECENTLY SHIPPED

v10.35 · stabilization wave · 565 smoke tests pass
v10.34 · archive integrity, image reply intent classifier
v10.31 · InstructPix2Pix + CLIPSeg inpainting
v10.25 · img2img + upscale
v10.12 · live GPU/VRAM monitoring

UP NEXT

Real-ESRGAN upscaler · scaffolded
TTS voice channel · Piper / XTTS
Persistent memory · per-user/channel JSON
SAM + GroundingDINO segmentation
NSFW gate · CLIP-based classifier

PLANNED.md08

09 / 09

CLOSE

Run it locally.

▸ GPL-2.0 licensed · Windows-first · ~6-minute install

Repository

github.com/NathanNeurotic/SeekDeep-DiscordBot

Version

v—

License

GPL-2.0

NATHANNEUROTIC / 2026END