Merge "sf: optimize luma sampling code" into qt-r1-dev