I recently had an on site at Google for a Systems SRE position. I did not get the job and the feedback that my recruiter gave me was that the hiring committee found my performance in the 'troubleshooting' interview to be lacking. I guess nothing can beat the experience of actually dealing with outages but I was wondering if anyone has suggestions on additional resources for this kind of interview.The one's that I'm aware of:
* The troubleshooting chapter in the Google SRE book
* https://jvns.ca/zines/
* http://www.brendangregg.com/linuxperf.html
* Reading about past outages: https://github.com/danluu/post-mortems
Thank you very much!