* Country and remote work preferences
* Employer type (e.g., startup, corporation, government)
* Industry
* Technologies used
* Role type (developer, architect, product owner, etc.)
* Salary range (where available)
One of the superpowers of LLMs is reformatting information from any format X to any other format Y. We leverage this to map all the unstructured job postings into the same unified structure. The new GPT functions feature and the extended context windows are really helpful for this. Instead of having to build a custom NER pipeline, it works very well with GPT out-of-the box.
One challenge is keeping the filters consistent and merging of duplicates. Embeddings help with that.
What's next:
* Integrate additional sources. We can generate web scrapers and data processing steps on the fly that extract and transform data into the same structure.
* Add location distance filters.
* Expand beyond jobs to monitor personalized data like events or real estate. Imagine using AI to rate local events from multiple sources based on your preferences, considering factors like your interests and distance from home.
* Smaller improvements based on your feedback :)