Hi! I built an augmentation tool for biomedical texts to help reduce repetition in ML datasets by targeting species and strain names. The tool has been tested with NER, and there are also two datasets available: https://github.com/tznurmin/TEA_curated_data
Hope you find it useful!
Show HN: Text augmentation tool for biomedical texts | Heykuki News