Added caching in PyperfNativeStack #36

IzabellaRaulin · 2022-09-13T17:07:10Z

Updating the performance results after caching returned symbols by unw_get_proc_name
in-progress: Updating the performance results for the latest commit
Full description and results are available at [Ready-to-review] Adding caching to PyPerfNativeStackTrace intel/gprofiler#421

Before (docker image granulate/gprofiler:1.6.0):

[Outdated] After enabling caching

examples/cpp/pyperf/PyPerfNativeStackTrace.cc

examples/cpp/pyperf/PyPerfNativeStackTrace.h

examples/cpp/pyperf/PyPerfNativeStackTrace.cc

Jongy · 2022-09-18T22:48:33Z

examples/cpp/pyperf/PyPerfNativeStackTrace.cc

 const uint8_t *NativeStackTrace::stack = NULL;
 size_t NativeStackTrace::stack_len = 0;
 uintptr_t NativeStackTrace::sp = 0;
 uintptr_t NativeStackTrace::ip = 0;
+time_t  NativeStackTrace::now;


Suggested change

time_t NativeStackTrace::now;

time_t NativeStackTrace::now;

There some other formatting issues - you can run ./scripts/git-clang-format to fix them all :)

The last commit contains git-clang-format formatting changes for files modified in the PR.

Jongy · 2022-09-18T22:51:16Z

examples/cpp/pyperf/PyPerfNativeStackTrace.cc

+  return false;
+}
+
+UnwindCacheEntry NativeStackTrace::cache_get(const UnwindCache &map, const uint32_t &key) {


Why not return a reference here?

My objective was to limit cache_get to read-only access and to make it more explicit that the content of the cache is only changing when cache_put or cache_delete_key are called. The code is then easier to maintain especially since there is no need to have a reference there.

Do you prefer having a reference here? Might you provide briefly why you think so? Maybe I missed something...I am open to your suggestion and looking forward to your opinion and guidelines.

Jongy · 2022-09-18T23:00:45Z

examples/cpp/pyperf/PyPerfNativeStackTrace.cc

+void NativeStackTrace::cache_put(UnwindCache &mp, const uint32_t &key, const unw_cursor_t cursor, const unw_addr_space_t as, void *upt) {  
+  // Check available capacity
+  if (cache_size() > NativeStackTrace::CacheMaxSizeMB*1024*1024 - cache_single_entry_size()) { 
+    logInfo(2, "The cache usage is %.2f MB, close to reaching the max memory usage (%d MB)\n", cache_size_KB()/1024, NativeStackTrace::CacheMaxSizeMB);


2 means it's not printed by default - only if we pass -v 2 or higher. Correct?

(these should not be printed by default, only when debugging)

It is not printed by default (and longer story below), but it makes sense to put here logLevel=3 to distinguish this info from others that are more of a warning nature.

Long story short, this is related to what I reported regarding my troubles getting/finding logs from PyPerfNativeStack. Increasing verbosity on granulate/gprofiler does not make those bcc/pyperf logs visible in gprofiler stdout. Increasing setVerbosityLevel in PyPerfLoggingHelper.cc) does not help as well.

I understand no one has reported this problem before so maybe it's my lack of knowledge of how the gprofiler handles logs coming from its modules. So, I decided to check the correctness of those new logs (especially passed values - how they look like) by dumping them into a temp file. For this purpose, I modified locally void logInfo to open a /tmp/myfile.txt and process vfprintf to it. Below I share the code diff.

void logInfo(uint64_t logLevel, const char* fmt, ...) { + + if (logLevel <= 2 ) { + va_list va; + va_start(va, fmt); + FILE * pFile; + pFile = fopen("/tmp/gprofiler_tmp/izahelperfile.txt","a"); + if (pFile) { + std::vfprintf(pFile, fmt, va); + fclose (pFile); + va_end(va); + } [...] }

It's an interesting idea to increase verbosity in PyPerf when you increase verbosity in gProfiler indeed. It's not the case. That doesn't explain why PyPerf shows no prints though.

I applied this diff on gProfiler and now I do see log messages in the PyPerf output: print:

diff --git a/gprofiler/profilers/python_ebpf.py b/gprofiler/profilers/python_ebpf.py index 9113f355..649f9367 100644 --- a/gprofiler/profilers/python_ebpf.py +++ b/gprofiler/profilers/python_ebpf.py @@ -175,6 +175,8 @@ class PythonEbpfProfiler(ProfilerBase): str(self._EVENTS_BUFFER_PAGES), "--symbols-map-size", str(self._SYMBOLS_MAP_SIZE), + "-v", + "9", # Duration is irrelevant here, we want to run continuously. ] + self._offset_args()

I'll open a ticket to add it, it's a good feature that'll help debug PyPerf in the future.

Created intel/gprofiler#516

Jongy · 2022-09-18T23:15:36Z

examples/cpp/pyperf/PyPerfNativeStackTrace.cc

-    this->symbols.push_back(error.str());
-    this->error_occurred = true;
-    goto out;
+  // Pseudo-proactive way of implementing TTL - whenever any call is made, all expired entries are removed


Together with TTL-based eviction - PyPerf already tracks PIDs exiting so we can (and should) make use of that - otherwise we risk PID reuse getting us false results.

See populatePidTable where in the Pruning dead pids step it removes PIDs. We should call cache_delete_key there as well. Please ensure it actually works in removing PIDs that exit

I push a draft of implementation.
populatePIDTable calls static NativeStackTrace::Pruning_dead_pid(). Please take a look at the proposal and let me know if such an approach is acceptable. It is WIP, so please ignore debug logs. I will remove it later.

In the meantime, I am debugging why the cache version gives missing and do not manage to add native symbols properly.

It's fine with me. Despite written in C++ this entire project is fairly functional.

Thanks! I removed debug leftovers.

Jongy · 2022-09-29T19:06:01Z

@IzabellaRaulin , next times if you need to update from master, you can just merge it instead of rebasing. Rebase causes github to lose the current review state (I cannot see just the new changes). In any case we use squash-and-merge here, so whether you used merge or rebase to update your branch doesn't matter much eventually :)

Jongy · 2022-09-29T19:06:39Z

Besides the debug prints etc - please also remove all unrelated formatting changes before we merge (e.g all blank lines addition/removals, ...)

IzabellaRaulin · 2022-10-03T16:25:58Z

@IzabellaRaulin , next times if you need to update from master, you can just merge it instead of rebasing. Rebase causes github to lose the current review state (I cannot see just the new changes). In any case we use squash-and-merge here, so whether you used merge or rebase to update your branch doesn't matter much eventually :)

Sorry, I didn't realize. But I put new changes in separate commits, so I hope that will help to track what is new

IzabellaRaulin · 2022-10-06T15:30:59Z

Added commit 'Fixed missing symbols` to address the observed issue with missing native symbols in the version with enabled caching.

After:

Before (to prove the root cause was identified properly):

….while loop

It enables global cache shared by all threads.

IzabellaRaulin mentioned this pull request Sep 13, 2022

[Ready-to-review] Adding caching to PyPerfNativeStackTrace intel/gprofiler#421

Open

1 task

Jongy reviewed Sep 13, 2022

View reviewed changes

examples/cpp/pyperf/PyPerfNativeStackTrace.cc Outdated Show resolved Hide resolved

Jongy reviewed Sep 13, 2022

View reviewed changes

examples/cpp/pyperf/PyPerfNativeStackTrace.cc Outdated Show resolved Hide resolved

Jongy reviewed Sep 13, 2022

View reviewed changes

examples/cpp/pyperf/PyPerfNativeStackTrace.cc Outdated Show resolved Hide resolved

Jongy reviewed Sep 13, 2022

View reviewed changes

examples/cpp/pyperf/PyPerfNativeStackTrace.h Outdated Show resolved Hide resolved

Jongy reviewed Sep 13, 2022

View reviewed changes

examples/cpp/pyperf/PyPerfNativeStackTrace.h Outdated Show resolved Hide resolved

Jongy reviewed Sep 13, 2022

View reviewed changes

examples/cpp/pyperf/PyPerfNativeStackTrace.cc Show resolved Hide resolved

IzabellaRaulin added a commit to IzabellaRaulin/gprofiler that referenced this pull request Sep 14, 2022

Point to commit within addressed review comments in PR Granulate/bcc#36

08f60a4

Jongy reviewed Sep 18, 2022

View reviewed changes

IzabellaRaulin added 4 commits September 29, 2022 17:16

Added caching in PyperfNativeStack

79395c7

Addressed review comments

7c711d6

Removed whitespace

9cc65a8

Set logLevel=3 for informative logs in PyperfNativeStack

b6da372

IzabellaRaulin force-pushed the pyperf_caching branch 4 times, most recently from d753a85 to 3438223 Compare September 29, 2022 16:28

IzabellaRaulin added 2 commits October 3, 2022 18:07

Added pruning dead pip called in PyPerfProfiler::populatePidTable()

673562d

Formatting (git-clang-format)

762f80d

IzabellaRaulin force-pushed the pyperf_caching branch from 3438223 to 762f80d Compare October 3, 2022 16:10

Fixed 'missing' symbols

15f609e

IzabellaRaulin added 2 commits October 11, 2022 21:03

Limit variable assignments - if cached, only cursor is needed in do..…

53968d3

….while loop

Set unwind caching policy of address space 'as' to UNW_CACHE_GLOBAL.

ba3df2b

It enables global cache shared by all threads.

IzabellaRaulin added 2 commits October 31, 2022 18:35

Added symbols returned by unw_get_proc_name into cache

739f411

Added log to monitor number of cache's entries and size

624f0ef

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added caching in PyperfNativeStack #36

Added caching in PyperfNativeStack #36

IzabellaRaulin commented Sep 13, 2022 •

edited

Loading

Jongy Sep 18, 2022

Jongy Sep 18, 2022

IzabellaRaulin Oct 3, 2022

Jongy Sep 18, 2022

IzabellaRaulin Oct 3, 2022 •

edited

Loading

Jongy Sep 18, 2022

IzabellaRaulin Sep 20, 2022

Jongy Sep 29, 2022

Jongy Sep 29, 2022

Jongy Sep 29, 2022

Jongy Sep 18, 2022

IzabellaRaulin Sep 29, 2022

Jongy Sep 29, 2022

IzabellaRaulin Oct 3, 2022

Jongy commented Sep 29, 2022

Jongy commented Sep 29, 2022

IzabellaRaulin commented Oct 3, 2022

IzabellaRaulin commented Oct 6, 2022

Added caching in PyperfNativeStack #36

Are you sure you want to change the base?

Added caching in PyperfNativeStack #36

Conversation

IzabellaRaulin commented Sep 13, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

IzabellaRaulin Oct 3, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Jongy commented Sep 29, 2022

Jongy commented Sep 29, 2022

IzabellaRaulin commented Oct 3, 2022

IzabellaRaulin commented Oct 6, 2022

IzabellaRaulin commented Sep 13, 2022 •

edited

Loading

IzabellaRaulin Oct 3, 2022 •

edited

Loading