# Detecting actual CPU count: 32 detected # JMH 1.6.3 (released 5 days ago) # VM invoker: /aurora/clredest/contended/jdk-baseline/jre/bin/java # VM options: -XX:-UseBiasedLocking -XX:+UseHeavyMonitors # Warmup: # Measurement: 1 iterations, 500 ms each, 40000000 calls per op # Timeout: 10 min per iteration # Threads: 32 threads # Benchmark mode: Single shot invocation time # Benchmark: org.openjdk.micros.contended.ContendedMonitorBench.synchronizeAndAccessContendedHeader # Parameters: (inflate = false, monitors = 8, step = 8) # Run progress: 0.00% complete, ETA 00:00:00 # Fork: 1 of 1 # Preparing profilers: perf # Profilers consume stderr from target VM, use -v EXTRA to copy to console Iteration 1: 34071505.657 ±(99.9%) 3889570.654 us/op # Processing profiler results: perf Run result: 34071505.66 us/op (<= 2 samples) Perf stats: -------------------------------------------------- 454807.194747 task-clock # 11.200 CPUs utilized 20,338,724 context-switches # 0.045 M/sec 115,283 CPU-migrations # 0.000 M/sec 14,454 page-faults # 0.000 M/sec 1,235,040,200,815 cycles # 2.716 GHz [25.11%] 937,724,171,260 stalled-cycles-frontend # 75.93% frontend cycles idle [25.04%] 754,607,171,325 stalled-cycles-backend # 61.10% backend cycles idle [24.97%] 647,227,878,080 instructions # 0.52 insns per cycle # 1.45 stalled cycles per insn [31.21%] 128,729,261,251 branches # 283.041 M/sec [31.23%] 892,349,800 branch-misses # 0.69% of all branches [31.07%] 209,690,375,343 L1-dcache-loads # 461.053 M/sec [31.15%] 7,263,377,868 L1-dcache-load-misses # 3.46% of all L1-dcache hits [31.16%] 58,354,410 LLC-loads # 0.128 M/sec [24.96%] 56,520,114 LLC-load-misses # 96.86% of all LL-cache hits [25.11%] L1-icache-loads 5,813,797,534 L1-icache-load-misses # 0.00% of all L1-icache hits [25.07%] 211,667,016,841 dTLB-loads # 465.399 M/sec [25.17%] 350,056,756 dTLB-load-misses # 0.17% of all dTLB cache hits [25.15%] 482,333,536 iTLB-loads # 1.061 M/sec [25.11%] 101,832,008 iTLB-load-misses # 21.11% of all iTLB cache hits [25.14%] L1-dcache-prefetches 732,360,566 L1-dcache-prefetch-misses # 1.610 M/sec [25.11%] 40.607187242 seconds time elapsed WARNING: Your system uses old "perf", which can not delay data collection. Therefore, perf performance data includes benchmark warmup. # JMH 1.6.3 (released 5 days ago) # VM invoker: /aurora/clredest/contended/jdk-baseline/jre/bin/java # VM options: -XX:-UseBiasedLocking -XX:+UseHeavyMonitors # Warmup: # Measurement: 1 iterations, 500 ms each, 40000000 calls per op # Timeout: 10 min per iteration # Threads: 32 threads # Benchmark mode: Single shot invocation time # Benchmark: org.openjdk.micros.contended.ContendedMonitorBench.synchronizeAndAccessOwnHeader # Parameters: (inflate = false, monitors = 8, step = 8) # Run progress: 33.33% complete, ETA 00:01:21 # Fork: 1 of 1 # Preparing profilers: perf # Profilers consume stderr from target VM, use -v EXTRA to copy to console Iteration 1: 21311763.233 ±(99.9%) 1557316.704 us/op # Processing profiler results: perf Run result: 21311763.23 us/op (<= 2 samples) Perf stats: -------------------------------------------------- 293788.038146 task-clock # 10.378 CPUs utilized 11,852,161 context-switches # 0.040 M/sec 27,173 CPU-migrations # 0.000 M/sec 14,091 page-faults # 0.000 M/sec 800,160,274,181 cycles # 2.724 GHz [24.99%] 562,454,366,001 stalled-cycles-frontend # 70.29% frontend cycles idle [24.94%] 422,051,711,618 stalled-cycles-backend # 52.75% backend cycles idle [25.11%] 559,232,718,481 instructions # 0.70 insns per cycle # 1.01 stalled cycles per insn [31.43%] 109,187,319,552 branches # 371.653 M/sec [31.47%] 603,734,198 branch-misses # 0.55% of all branches [31.45%] 183,301,942,383 L1-dcache-loads # 623.926 M/sec [31.47%] 4,417,616,365 L1-dcache-load-misses # 2.41% of all L1-dcache hits [31.44%] 48,189,826 LLC-loads # 0.164 M/sec [24.94%] 45,106,878 LLC-load-misses # 93.60% of all LL-cache hits [24.96%] L1-icache-loads 3,532,566,051 L1-icache-load-misses # 0.00% of all L1-icache hits [24.96%] 185,236,407,143 dTLB-loads # 630.510 M/sec [25.00%] 217,630,499 dTLB-load-misses # 0.12% of all dTLB cache hits [25.09%] 318,963,345 iTLB-loads # 1.086 M/sec [25.08%] 73,838,515 iTLB-load-misses # 23.15% of all iTLB cache hits [24.97%] L1-dcache-prefetches 485,647,943 L1-dcache-prefetch-misses # 1.653 M/sec [24.91%] 28.307850049 seconds time elapsed WARNING: Your system uses old "perf", which can not delay data collection. Therefore, perf performance data includes benchmark warmup. # JMH 1.6.3 (released 5 days ago) # VM invoker: /aurora/clredest/contended/jdk-baseline/jre/bin/java # VM options: -XX:-UseBiasedLocking -XX:+UseHeavyMonitors # Warmup: # Measurement: 1 iterations, 500 ms each, 40000000 calls per op # Timeout: 10 min per iteration # Threads: 32 threads # Benchmark mode: Single shot invocation time # Benchmark: org.openjdk.micros.contended.ContendedMonitorBench.synchronizeAndAccessUncontendedHeader # Parameters: (inflate = false, monitors = 8, step = 8) # Run progress: 66.67% complete, ETA 00:00:34 # Fork: 1 of 1 # Preparing profilers: perf # Profilers consume stderr from target VM, use -v EXTRA to copy to console Iteration 1: 12549522.949 ±(99.9%) 1186661.763 us/op # Processing profiler results: perf Run result: 12549522.95 us/op (<= 2 samples) Perf stats: -------------------------------------------------- 245806.906985 task-clock # 15.211 CPUs utilized 6,306,558 context-switches # 0.026 M/sec 11,512 CPU-migrations # 0.000 M/sec 14,159 page-faults # 0.000 M/sec 683,122,421,989 cycles # 2.779 GHz [24.98%] 556,889,261,885 stalled-cycles-frontend # 81.52% frontend cycles idle [24.95%] 464,524,290,293 stalled-cycles-backend # 68.00% backend cycles idle [24.99%] 261,683,864,739 instructions # 0.38 insns per cycle # 2.13 stalled cycles per insn [31.19%] 60,259,405,996 branches # 245.149 M/sec [31.20%] 778,056,052 branch-misses # 1.29% of all branches [31.22%] 83,416,687,681 L1-dcache-loads # 339.359 M/sec [31.30%] 3,265,948,102 L1-dcache-load-misses # 3.92% of all L1-dcache hits [31.45%] 40,050,680 LLC-loads # 0.163 M/sec [25.33%] 38,607,848 LLC-load-misses # 96.40% of all LL-cache hits [25.38%] L1-icache-loads 2,427,778,617 L1-icache-load-misses # 0.00% of all L1-icache hits [25.27%] 84,311,190,001 dTLB-loads # 342.998 M/sec [25.02%] 162,148,542 dTLB-load-misses # 0.19% of all dTLB cache hits [24.94%] 386,956,613 iTLB-loads # 1.574 M/sec [24.92%] 40,675,592 iTLB-load-misses # 10.51% of all iTLB cache hits [24.95%] L1-dcache-prefetches 378,996,666 L1-dcache-prefetch-misses # 1.542 M/sec [25.05%] 16.160005909 seconds time elapsed WARNING: Your system uses old "perf", which can not delay data collection. Therefore, perf performance data includes benchmark warmup. # Run complete. Total time: 00:01:25 Benchmark (inflate) (monitors) (step) Mode Cnt Score Error Units ContendedMonitorBench.synchronizeAndAccessContendedHeader false 8 8 ss 1 34071505.657 ± NaN us/op ContendedMonitorBench.synchronizeAndAccessContendedHeader:@cpi false 8 8 ss 1 1.908 ± NaN CPI ContendedMonitorBench.synchronizeAndAccessOwnHeader false 8 8 ss 1 21311763.233 ± NaN us/op ContendedMonitorBench.synchronizeAndAccessOwnHeader:@cpi false 8 8 ss 1 1.431 ± NaN CPI ContendedMonitorBench.synchronizeAndAccessUncontendedHeader false 8 8 ss 1 12549522.949 ± NaN us/op ContendedMonitorBench.synchronizeAndAccessUncontendedHeader:@cpi false 8 8 ss 1 2.610 ± NaN CPI