Search: github.com/Lision | Heykuki News

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

361.

Show HN: Slideo – Synchronize Slides with Video Using Computer Vision (OpenCV) (github.com/hediet)

2 points

5 years ago

362.

pip3 install videoflow - New library for computer vision on videos

2 points

7 years ago

363.

JavaScript Computer Vision library. (inspirit.github.com)

2 points

13 years ago

364.

Show HN: SoMatic – Vision-based OS automation framework for AI agents (github.com/Smyan1909)

2 points

16 days ago

365.

Show HN: Neuroscope – Real-time “x-ray vision” into LLMs’ minds (github.com/cjroth)

2 points

3 months ago

366.

Alibaba releases open-source vision model for native layered image editing (github.com/QwenLM)

2 points

6 months ago

367.

Yzma – local Vision Language Models/LLMs in Go using llama.cpp without CGo (github.com/hybridgroup)

2 points

8 months ago

368.

Show HN: Magnitude MCP – vision-first browser interaction for Claude Code (github.com/sagekit)

2 points

8 months ago

369.

Show HN: Demo of AI-enabled voice/vision features on open source hardware [video] (youtube.com)

2 points

9 months ago

370.

Show HN: Plug-and-play Python utils for any computer-vision pipeline (github.com/roboflow)

2 points

a year ago

371.

Show HN: I achieved over 10% improvement on 3D vision PointCLIP (github.com/genji970)

2 points

a year ago

372.

Smolvlm – Realtime Vision Language Model Demo (github.com/ngxson)

2 points

a year ago

373.

Search images like text using Vision Language Models (github.com/StarlightSearch)

2 points

a year ago

374.

OmniTool – Control a Windows 11 VM with OmniParser plus vision model of choice (github.com/microsoft)

2 points

a year ago

375.

Sparrow: Open-source data processing with ML, LLM and Vision LLM (github.com/katanaml)

2 points

a year ago

376.

Visual Product Search: Combining React Native, Cloud Vision, Algolia, and Remix

2 points

a year ago

377.

ShowUI: A lightweight vision-language-action model for GUI agents (github.com/showlab)

2 points

2 years ago

378.

BiomedGPT: A Generalist Vision-Language Foundation Model for Biomedical Tasks (github.com/taokz)

2 points

giuliomagnifico

2 years ago

379.

Mini-Omni2: Towards Open-Source GPT-4o with Vision, Speech, Duplex Capabilities (github.com/gpt-omni)

2 points

2 years ago

380.

Ollama with Experimental Vision Support (github.com/ollama)

2 points

2 years ago

381.

Show HN: Created a notebook to compare the top LMSYS vision models easily (github.com/Portkey-AI)

2 points

2 years ago

382.

Recognize faces in photos using local models with Apple Vision (github.com/Nexuist)

2 points

2 years ago

383.

Show HN: I made a simple unified LLM client with tool calling and vision support (github.com/piEsposito)

2 points

someguy12345678

2 years ago

384.

Implementation of Google's ScreenAI: Vision-Lang Model for UI and Understanding (github.com/kyegomez)

2 points

2 years ago

385.

Apple Vision Pro and ROG Ally: Portable console gaming setup guide (gist.github.com)

2 points

2 years ago

386.

Godot Support for VisionOS (github.com/kevinw)

2 points

2 years ago

387.

TrackTales: Zero-shot narrator for mpd using GPT-4-vision (github.com/mlang)

2 points

2 years ago

388.

Moe-LLaVA: Mixture of Experts for Large Vision-Language Models (github.com/PKU-YuanGroup)

2 points

2 years ago

389.

GPT Video – Reproducing the Gemini Demo Using GPT 4 Vision (github.com/jide)

2 points

2 years ago

390.

GPT-Vision first most reliable open-source browser automation (github.com/vignshwarar)

2 points

2 years ago