Concurrency is applied per output pixel due to the output device's demands, with a decently-designed index used to minimize the cost of hittesting each bounce. This same index can be reused for kinetics simulations!
Good old computer-science techniques!
1.1/1.1 Fin for today! Tomorrow: Modern GPU design.