Audio transcription system, where we can use artificial intelligence to transcribe audio and send it in the conversation as a message