AICI is based on Wasm, and is designed to be fast (runs on CPU while GPU is busy), secure (can run in multi-tenant cloud deployments), and flexible (allow libraries like Guidance, LMQL, Outlines, etc. to work on top of it).
We (Microsoft Research) have released it recently, and would love feedback on the design of the interface, as well as our Rust AICI runtime.
I'm the lead developer on this project and happy to answer any questions!