Replies: 4 comments
-
I think it's better to further split this into two issues, but I will leave my comments for both topics. We have something like PipelineMonitoringHook in our docs. It requires some infrastructure and it's not easy to set up by regular users. Pipeline StatisticHow would these pipeline stats be useful?
Smarter way to re-run the pipelineSimilar to the AlternativeCurrently, it requires users to figure out which nodes are not necessary, and do SummaryOne key realization of this change is that |
Beta Was this translation helpful? Give feedback.
-
Related Issue: |
Beta Was this translation helpful? Give feedback.
-
This is a very frequent question actually, will try to collect more evidence for it going forward. There's different things when considering performance, namely (1) execution time, and (2) RAM usage. There are different tools for each of these purposes, so most likely we would need dedicated efforts. I think execution time is probably the most urgent one. This is how I used pyinstrument #3033 (comment) |
Beta Was this translation helpful? Give feedback.
-
Similarly to #4264, what prevents users from using cProfile, line-profiler, PyInstrument, or any other Python tool for performance analysis?
Tracked in #2307, more context in #221 Moving this to Discussions |
Beta Was this translation helpful? Give feedback.
-
Description
Kedro currently doesn't offer any options to analyse the performance of pipelines. Additionally, our users have flagged that they would like to be able to re-run only parts of their pipeline.
Implementation ideas
Questions
Beta Was this translation helpful? Give feedback.
All reactions