We created Piglet, a code generator and compiler that creates Spark or Flink jobs from Pig Latin input scripts: https://github.com/dbis-ilm/piglet/I'm interested in real world Pig scripts and input data to test the implementation. Is anyone using Apache Pig actively?