Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
31.
▲
Show HN: Open-Source Document Extraction Tool
(github.com/harishdeivanayagam)
13 points
harishd30
a year ago
2 comments
32.
▲
Show HN: AREnets – TensorFlow-based Relation Extraction kit for work in Colab
(github.com/nicolay-r)
10 points
nicolay-r
3 years ago
discuss
33.
▲
genstring string extraction tool for Swift – iOS
(github.com/KeepSafe)
9 points
philippb
11 years ago
discuss
34.
▲
Show HN: Sculptor – Python library for LLM structured data extraction (MIT)
(github.com/lightning-rod-labs)
9 points
bturtel
a year ago
discuss
35.
▲
Unstract: Open-source platform to ship document extraction APIs in minutes
(github.com/Zipstack)
8 points
naren87
10 months ago
1 comment
36.
▲
Show HN: Camelot – PDF Table Extraction for Humans
(github.com/socialcopsdev)
8 points
vortex_ape
8 years ago
discuss
37.
▲
Show HN: Structured HTML table data extraction from URLs in Go
(github.com/nfx)
7 points
nf-x
4 years ago
2 comments
38.
▲
Show HN: XTractor, a simple heuristics-based webpage text extraction demo
(github.com/mohaps)
7 points
mohaps
11 years ago
discuss
39.
▲
Show HN: Extractous, Rust based data extraction ~25x faster than unstructured-io
(github.com/yobix-ai)
6 points
nmammeri
2 years ago
discuss
40.
▲
Benchmark for Audio Feature Extraction Libraries
(github.com/libAudioFlux)
6 points
james0517
3 years ago
discuss
41.
▲
Show HN: Chopper – Easy HTML/CSS Extraction with Python
(github.com/jurismarches)
5 points
Socketubs
11 years ago
discuss
42.
▲
Show HN: Kreuzberg v3.0 – Modern Python Document Extraction
5 points
nhirschfeld
a year ago
discuss
43.
▲
GCC/gcov code coverage data extraction from the target, without FS, OS, or Libc
(github.com/nasa-jpl)
5 points
gsempe
4 years ago
discuss
44.
▲
ExtractNet: Open-source ML based content extraction
(github.com/currentsapi)
5 points
polymorph1sm
5 years ago
discuss
45.
▲
Show HN: Multimodal Search Using GPT4o for Metadata Extraction and Hybrid Index
(github.com/pathwaycom)
4 points
janchorowski
2 years ago
2 comments
46.
▲
Agentic Doc: Agentic Data Extraction from Visually Complex Documents
(github.com/landing-ai)
4 points
yanng404
a year ago
discuss
47.
▲
XML/HTML Extraction and Removal on Nim. Features Nim Excellences
(github.com/abdulbadii)
4 points
mardiyah
4 years ago
discuss
48.
▲
Improved query fields extraction helper for GraphQL
(github.com/Mikhus)
3 points
MykhailoStadnyk
8 years ago
2 comments
49.
▲
AudioFlux: A library for audio and music analysis, feature extraction
(github.com)
3 points
CMLab
3 years ago
1 comment
50.
▲
Newspaper is a tool for news extraction and curation in Python
(github.com/codelucas)
3 points
ekianjo
12 years ago
discuss
51.
▲
Show HN: Newspaper, simple news extraction and curation in python
(github.com/codelucas)
3 points
louyang
12 years ago
discuss
52.
▲
Unstract: Open-source platform to ship document extraction APIs in minutes
(github.com/Zipstack)
3 points
naren87
9 months ago
discuss
53.
▲
Show HN: Easy data extraction from text with Pydantic and OpenAI
(github.com/jiggy-ai)
3 points
wskish
3 years ago
discuss
54.
▲
Mask ROM Extraction
(github.com/travisgoodspeed)
3 points
picture
3 years ago
discuss
55.
▲
Automatic time series feature extraction based on scalable hypothesis tests
(github.com/blue-yonder)
3 points
batterylow
5 years ago
discuss
56.
▲
Tsfresh: Automatic extraction of relevant features from time series
(github.com/blue-yonder)
3 points
pjf
7 years ago
discuss
57.
▲
Using AWS Lambda for fast OCR text extraction (and non OCR too)
(github.com/skylander86)
3 points
skylander
9 years ago
discuss
58.
▲
Show HN: Knowledge Table – Explainable multi-document extraction
(github.com/whyhow-ai)
2 points
tomsmoker
2 years ago
1 comment
59.
▲
GPT-based ontological extraction tools, including SPIRES
(github.com/monarch-initiative)
2 points
gardenfelder
3 years ago
1 comment
60.
▲
DEDA – Tracking Dots Extraction, Decoding and Anonymisation Toolkit
(github.com/dfd-tud)
2 points
d4a
4 years ago
1 comment
More