Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
451.
▲
VisionOS Godot Engine support merged
1 point
iFire
a year ago
discuss
452.
▲
Show HN: Vision AI Label Studio – Open-Source Image Labeling Tool
(vailabel.com)
1 point
vicheanath
a year ago
discuss
453.
▲
Show HN: OSS AI Agent for Computer Vision
(github.com/picselliahq)
1 point
thibautlucas
a year ago
discuss
454.
▲
[Google Research] Handwriting Conversion with Vision Language Model
(github.com/google-research)
1 point
moatmoat
a year ago
discuss
455.
▲
Show HN: Vision, PDF reading and Python
(github.com/ilevd)
1 point
ilevd
a year ago
discuss
456.
▲
Computer vision models inference directly on mobile
(github.com/software-mansion)
1 point
mrys
a year ago
discuss
457.
▲
DeepSeek-VL2: Moe Vision-Language Models for Advanced Multimodal Understanding [pdf]
(github.com/deepseek-ai)
1 point
limoce
a year ago
discuss
458.
▲
OpenDAL Going to Set Vision as "One Layer, All Storage"
(github.com/apache)
1 point
xuanwo
2 years ago
discuss
459.
▲
Show HN: Capd – idea to visually analyze active PowerShell with OpenAI Vision
(github.com/Lywald)
1 point
anon012012
2 years ago
discuss
460.
▲
Roboflow Notebooks: 60+ computer vision modeling notebooks
(github.com/roboflow)
1 point
zerojames
2 years ago
discuss
461.
▲
Eagle: Vision-Centric High-Resolution Multimodal LLMs with Mixture of Encoders
(github.com/NVlabs)
1 point
taikon
2 years ago
discuss
462.
▲
Unibench: Vision-Language Model Evaluation
(github.com/facebookresearch)
1 point
zerojames
2 years ago
discuss
463.
▲
Try to dump traditional mouse. Click by [Vim] + [screen vision-recognition] way
(github.com/garywill)
1 point
gry_gh
2 years ago
discuss
464.
▲
Show HN: Gesture Composer for VisionOS [video]
(youtube.com)
1 point
nthState
2 years ago
discuss
465.
▲
Moondream: Tiny Vision Language Model
(github.com/vikhyat)
1 point
zerojames
2 years ago
discuss
466.
▲
Show HN: Geniusrise – open-source inference endpoints for text, vision, audio
(github.com)
1 point
ixaxaar
2 years ago
discuss
467.
▲
Show HN: Building WebApp with Vision Pro Like UI with CSS
(github.com/kelvinkoko)
1 point
kelvinko
2 years ago
discuss
468.
▲
3D Printing Failure Detection with GPT4 Vision
(github.com/myrakrusemark)
1 point
miduil
2 years ago
discuss
469.
▲
SeeAct GPT-4V(ision) Is a Generalist Web Agent, If Grounded
(github.com/OSU-NLP-Group)
1 point
r_singh
2 years ago
discuss
470.
▲
AI Employe: Actions Augmented Browser Automation Using GPT-4 Vision
(github.com/vignshwarar)
1 point
vignesh_warar
2 years ago
discuss
471.
▲
Show HN: Labelformat now supports all major vision labeling formats
(github.com/lightly-ai)
1 point
isusmelj
3 years ago
discuss
472.
▲
Sound and Vision – Video Streaming to the ESP32
(github.com/atomic14)
1 point
iamflimflam1
3 years ago
discuss
473.
▲
Large Language-and-Vision Assistant for BioMedicine
(github.com/microsoft)
1 point
yagizdegirmenci
3 years ago
discuss
474.
▲
MViTv2: Improved Multiscale Vision Transformers for Classification and Detection
(github.com/facebookresearch)
1 point
teleforce
3 years ago
discuss
475.
▲
VoxelGPT: Open-source AI assistant for curating computer vision datasets
(github.com/voxel51)
1 point
sickeythecat
3 years ago
discuss
476.
▲
A general representation modal across vision, audio, language modalities
(github.com/OFA-Sys)
1 point
logikblok
3 years ago
discuss
477.
▲
DINOv2: Computer Vision Foundation Model by Meta AI Is on GitHub as CC-by-NC 4.0
(github.com/facebookresearch)
1 point
ftxbro
3 years ago
discuss
478.
▲
Open Source FiftyOne Computer Vision Data and Models
(github.com/voxel51)
1 point
sickeythecat
4 years ago
discuss
479.
▲
Show HN: Don't be drowsy Computer Vision and Twilio
(github.com/Curovearth)
1 point
Swarup_Tripathy
4 years ago
discuss
480.
▲
Self-driving car computer vision adapted for UI automation
(github.com/askui)
1 point
MenesJo
4 years ago
discuss
More