2) Google has a Data Loss Prevention API. https://cloud.google.com/dlp/ That requires uploading your sensitive data. Is that a concern for you? Do you know of better/cheaper alternatives?
2) Usually sensitive data is company specific. E.g., my sensitive data was biotech specific (e.g., chemical, enzyme names). How do you prevent leaks of your custom data?