Modular C++ Toolkit for Performance Analysis and Logging. Profiling API and Tools for C, C++, CUDA, Fortran, and Python. The C++ template API is essentially a framework to creating tools: it is designed to provide a unifying interface for recording various performance measurements alongside data logging and interfaces to other tools.
cereal::
-> tim::cereal::
)string_view
papi_threading
settingis_invalid
in base_statestack_push
stack_pop
insert
set_depth_change
set_is_flat
set_is_on_stack
set_is_invalid
set_iterator
get_is_flat
get_is_invalid
get_is_on_stack
get_depth
get_storage
get_iterator
timemory-run
for (Linux) dynamic instrumentation supporttimemory-avail
for component/settings/hw-counter availabilitytimem-mpi
for timem
+ MPItimemory-python-profiler
for python profilingtimemory-python-line-profiler
for python line-by-line profilingpapi_vector
data_tracker
for tracking values in applicationcomponent_bundle
, auto_bundle
, lightweight_tuple
trait::start_priority<T>
and trait::stop_priority<T>
use integral_constant instead of true/falseCOMPONENT |
---|
caliper |
cpu_clock |
cpu_roofline<Types...> |
cpu_util |
cuda_event |
cuda_profiler |
cupti_activity |
cupti_counters |
data_rss |
gotcha<size_t, Bundle, Diff> |
gperf_cpu_profiler |
gperf_heap_profiler |
gpu_roofline<Types...> |
likwid_nvmon |
likwid_perfmon |
monotonic_clock |
monotonic_raw_clock |
num_io_in |
num_io_out |
num_major_page_faults |
num_minor_page_faults |
num_msg_recv |
num_msg_sent |
num_signals |
num_swap |
nvtx_marker |
page_rss |
papi_array<size_t> |
papi_tuple<int...> . |
peak_rss |
priority_context_switch |
process_cpu_clock |
process_cpu_util |
read_bytes |
stack_rss |
system_clock |
tau_marker |
thread_cpu_clock |
thread_cpu_util |
trip_count |
user_bundle<size_t, Tag> |
user_clock |
virtual_memory |
voluntary_context_switch |
vtune_event |
vtune_frame |
wall_clock |
written_bytes |