memory access
# /opt/intel/oneapi/vtune/latest/bin64/vtune -help collect memory-access
Intel(R) VTune(TM) Profiler Command Line Tool
Copyright (C) 2009-2021 Intel Corporation. All rights reserved.
 Measure a set of metrics to identify memory access related issues (for
 example, specific for NUMA architectures). This analysis type is based on
 the hardware event-based sampling collection.
 To modify the analysis type, use the configuration options (knobs) as
 follows:
 -collect memory-access -knob =
 Multiple -knob options are allowed and can be followed by additional collect
 action options, as well as global options, if needed.
sampling-interval
  Specify an interval (in milliseconds) between CPU samples.
  Default value: 5
  Possible values: numbers between 0.01 and 1000
analyze-mem-objects
  Enable the instrumentation of dynamic memory allocation/de-allocation and
  map hardware events to such memory objects. This option may cause
  additional runtime overhead due to the instrumentation of all system memory
  allocation/de-allocation API.
  Default value: false
  Possible values: true false
mem-object-size-min-thres
  Specify a minimal size of dynamic memory allocations to analyze. This
  option helps reduce runtime overhead of the instrumentation.
  Default value: 1024
  Possible values: numbers between -2147483648 and 2147483647
dram-bandwidth-limits
  Evaluate maximum achievable local DRAM bandwidth before the collection
  starts. This data is used to scale bandwidth metrics on the timeline and
  calculate thresholds.
  Default value: true
  Possible values: true false
analyze-openmp
  Instrument and analyze OpenMP regions to detect inefficiencies such as
  imbalance, lock contention, or overhead on performing scheduling, reduction
  and atomic operations.
  Default value: false
  Possible values: true false
  // example
-knob sampling-interval=5  -knob analyze-mem-objects=true