Merge "Tuning of buffer dequeue code to reduce stalls" into pi-dev