Implement a variance weighted sampling to improve performance. Initial idea: - calculate the sum of variance of each cuda block. - Sort cuda blocks based on the variance - Pick #blocks/reduction_factor blocks to path trace. - Input the picked blocks to ray trace.