Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
451.
VisionOS Godot Engine support merged
1 point
iFire
a year ago
discuss
452.
Show HN: Vision AI Label Studio – Open-Source Image Labeling Tool (vailabel.com)
1 point
vicheanath
a year ago
discuss
453.
Show HN: OSS AI Agent for Computer Vision (github.com/picselliahq)
1 point
thibautlucas
a year ago
discuss
454.
[Google Research] Handwriting Conversion with Vision Language Model (github.com/google-research)
1 point
moatmoat
a year ago
discuss
455.
Show HN: Vision, PDF reading and Python (github.com/ilevd)
1 point
ilevd
a year ago
discuss
456.
Computer vision models inference directly on mobile (github.com/software-mansion)
1 point
mrys
a year ago
discuss
457.
DeepSeek-VL2: Moe Vision-Language Models for Advanced Multimodal Understanding [pdf] (github.com/deepseek-ai)
1 point
limoce
a year ago
discuss
458.
OpenDAL Going to Set Vision as "One Layer, All Storage" (github.com/apache)
1 point
xuanwo
2 years ago
discuss
459.
Show HN: Capd – idea to visually analyze active PowerShell with OpenAI Vision (github.com/Lywald)
1 point
anon012012
2 years ago
discuss
460.
Roboflow Notebooks: 60+ computer vision modeling notebooks (github.com/roboflow)
1 point
zerojames
2 years ago
discuss
461.
Eagle: Vision-Centric High-Resolution Multimodal LLMs with Mixture of Encoders (github.com/NVlabs)
1 point
taikon
2 years ago
discuss
462.
Unibench: Vision-Language Model Evaluation (github.com/facebookresearch)
1 point
zerojames
2 years ago
discuss
463.
Try to dump traditional mouse. Click by [Vim] + [screen vision-recognition] way (github.com/garywill)
1 point
gry_gh
2 years ago
discuss
464.
Show HN: Gesture Composer for VisionOS [video] (youtube.com)
1 point
nthState
2 years ago
discuss
465.
Moondream: Tiny Vision Language Model (github.com/vikhyat)
1 point
zerojames
2 years ago
discuss
466.
Show HN: Geniusrise – open-source inference endpoints for text, vision, audio (github.com)
1 point
ixaxaar
2 years ago
discuss
467.
Show HN: Building WebApp with Vision Pro Like UI with CSS (github.com/kelvinkoko)
1 point
kelvinko
2 years ago
discuss
468.
3D Printing Failure Detection with GPT4 Vision (github.com/myrakrusemark)
1 point
miduil
2 years ago
discuss
469.
SeeAct GPT-4V(ision) Is a Generalist Web Agent, If Grounded (github.com/OSU-NLP-Group)
1 point
r_singh
2 years ago
discuss
470.
AI Employe: Actions Augmented Browser Automation Using GPT-4 Vision (github.com/vignshwarar)
1 point
vignesh_warar
2 years ago
discuss
471.
Show HN: Labelformat now supports all major vision labeling formats (github.com/lightly-ai)
1 point
isusmelj
3 years ago
discuss
472.
Sound and Vision – Video Streaming to the ESP32 (github.com/atomic14)
1 point
iamflimflam1
3 years ago
discuss
473.
Large Language-and-Vision Assistant for BioMedicine (github.com/microsoft)
1 point
yagizdegirmenci
3 years ago
discuss
474.
MViTv2: Improved Multiscale Vision Transformers for Classification and Detection (github.com/facebookresearch)
1 point
teleforce
3 years ago
discuss
475.
VoxelGPT: Open-source AI assistant for curating computer vision datasets (github.com/voxel51)
1 point
sickeythecat
3 years ago
discuss
476.
A general representation modal across vision, audio, language modalities (github.com/OFA-Sys)
1 point
logikblok
3 years ago
discuss
477.
DINOv2: Computer Vision Foundation Model by Meta AI Is on GitHub as CC-by-NC 4.0 (github.com/facebookresearch)
1 point
ftxbro
3 years ago
discuss
478.
Open Source FiftyOne Computer Vision Data and Models (github.com/voxel51)
1 point
sickeythecat
4 years ago
discuss
479.
Show HN: Don't be drowsy Computer Vision and Twilio (github.com/Curovearth)
1 point
Swarup_Tripathy
4 years ago
discuss
480.
Self-driving car computer vision adapted for UI automation (github.com/askui)
1 point
MenesJo
4 years ago
discuss
More