Merge "[speech] implement recognizer proxy" into main