Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
271.
Infra for building multimodal embeddings, built in Rust for speed and robustness (github.com/StarlightSearch)
1 point
Sonam_AI
2 years ago
1 comment
272.
Guiding Instruction-Based Image Editing via Multimodal Large Language Models (github.com/tsujuifu)
1 point
andsoitis
2 years ago
1 comment
273.
A framework to enable multimodal models to operate a computer (github.com/OthersideAI)
1 point
pyinstallwoes
2 years ago
1 comment
274.
Go MultiModule Workspaces:The Easy Way to Build and Run Code in Multiple Modules (github.com/mobiledatabooks)
1 point
thstart
4 years ago
1 comment
275.
Show HN: imgp – multicore batch image resizer and rotator. Go crunch 'em (github.com/jarun)
1 point
apjana
9 years ago
1 comment
276.
Bucardo multimaster and master/slave Postgres replication (github.com/bucardo)
1 point
gnocchi
11 years ago
discuss
277.
Publication under FOSS licence of a multimodal journey planner (github.com/CanalTP)
1 point
tristramg
12 years ago
discuss
278.
Multimedia story telling for the web (github.com/codevise)
1 point
trutz
12 years ago
discuss
279.
Show HN: Gemini Omni – A curated list of native multimodal guides and showcases (github.com/cnemri)
1 point
cnemri
12 days ago
discuss
280.
Show HN: Reverse lookup XKCD comics using Gemini multimodal embeddings (github.com/hemanth)
1 point
init0
2 months ago
discuss
281.
Show HN: Pixrep – Turn code repositories into PDFs for multimodal LLMs (github.com/TingjiaInFuture)
1 point
TingjiaInFuture
4 months ago
discuss
282.
MiRAGE: Open-source framework for multimodal RAG evaluation
1 point
mmhetric
4 months ago
discuss
283.
Puma 3D Printed Multimodality Microscope (github.com/TadPath)
1 point
o4c
4 months ago
discuss
284.
Show HN: X-AnyLabeling – An open-source multimodal annotation ecosystem for CV (github.com/CVHub520)
1 point
CVHub520
6 months ago
discuss
285.
Show HN: Unisondb A open source streaming multimodal database for Edge Computing (github.com/ankur-anand)
1 point
ankuranand
7 months ago
discuss
286.
Multicloud app that includes DePIN (Demo) (github.com/dkloudio)
1 point
hkdb
10 months ago
discuss
287.
Neuralink Open Sources Data Catalog for Multimodal Data (github.com/neuralinkcorp)
1 point
skadamat
a year ago
discuss
288.
Qwen2.5-Omni is an end-to-end multimodal model (github.com/QwenLM)
1 point
tosh
a year ago
discuss
289.
Aana SDK, a framework for building AI enabled multimodal applications (github.com/mobiusml)
1 point
omneity
a year ago
discuss
290.
Show HN: Kfe – Cross-Platform Search Engine for Local Multimedia Files (github.com/Fl0k3n)
1 point
flok3n
a year ago
discuss
291.
Show HN: Magnitude – Natural language E2E testing with multimodal LLM agents (github.com/magnitudedev)
1 point
thrgreenwald
a year ago
discuss
292.
Full multimodal Android llm app running without netowrk (github.com/alibaba)
1 point
juude
a year ago
discuss
293.
AtomicRing; Fast MultiCast and Single Consumer Lock-Free Queues (github.com/rezabrizi)
1 point
rezatabrizi
a year ago
discuss
294.
Mfsync: Encrypted local filesharing using multicast host lookup (github.com/k4lipso)
1 point
kalipso
a year ago
discuss
295.
DeepSeek-VL2: Moe Vision-Language Models for Advanced Multimodal Understanding [pdf] (github.com/deepseek-ai)
1 point
limoce
a year ago
discuss
296.
Show HN: AnyModal – A Flexible Multimodal Language Model Framework for PyTorch (github.com/ritabratamaiti)
1 point
anneta
2 years ago
discuss
297.
Aria: Open Multimodal Native Moe (github.com/rhymes-ai)
1 point
simonpure
2 years ago
discuss
298.
Eagle: Vision-Centric High-Resolution Multimodal LLMs with Mixture of Encoders (github.com/NVlabs)
1 point
taikon
2 years ago
discuss
299.
Show HN: Multimodal PDF extraction using Sonnet 3.5/GPT-4o (github.com/graphlit)
1 point
kirkmarple
2 years ago
discuss
300.
LLM, MultiModal, and Agent Tools for ComfyUI (github.com/get-salt-AI)
1 point
sabrina_ramonov
2 years ago
discuss
More