The milestone of receiving audio from the ai pipeline has been reached. It needs fiurther optimization for speed but end to end works! After replcaing the B2BUA last further work ensued to get a stasis application to orchestrate audio fro the user and direct it into the kubernetes cluster with the GPU. After tweaking the python code I founf success to something usable. The key issues is chunking, sentence formation, silence and user talk and ai response to make the experience meaningful. Next week would be wiring in short/long term memory to keep conversations context aware….
Leave a Reply