Mining LLM Pre-Training Data from Codebases | Heykuki News