GNU Multiprecision cares a lot about performance (and correctness). As such there's a subsystem which uses C macros to tweak various APIs to capture performance metrics. With the core "cycle-counting" code implemented in Assembly for a variety of CPUs to avoid incurring any overhead & generating misleading numbers.
There's some noops implemented, presumably to test this infrastructure.
There's utilities implemented around this to eg. try out some magic numbers to find the most optimal one.
1/2