-
Notifications
You must be signed in to change notification settings - Fork 2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] composite_terms-keyword has increased latency with Lucene 10 index #17388
Comments
Can have the same cause as #17404 where we are expecting the merge policy/segment topology to be a probable cause for regression. |
I can see different number of segment in 2.19/3.0. This might be suspect causing the regression, In big5 run I can see 2.19 version had 18 segments while 3.0 had 22 segments.
After performing force_merge with 10 segments, I can see equal[p50] or improved[P90/P99] latencies.
Will trigger the multiple run and update the results. |
I have triggered multiple runs for big5 workload for both Opensearch 3.0 and Opensearch 2.19 version. Below is the results for the SetupInstance Type: c5.2xlarge Opensearch 3.0
Opensearch 2.19
We can see that query latency of the With We can see results for same version itself has different numbers based on segment count. Based on this results I don't think regression on 3.0 version but the random nature of workload causing this. @mgodwan / @backslasht Please provide your thoughts on this, in case I am missing something. |
Thanks @dhwanilpatel. These are helpful insights. @expani Could you confirm if the tests results you reported in the issue followed force merge before running searches to ensure a consistent benchmark setup for search? |
Thanks for the thorough analysis @dhwanilpatel I had a similar observation that OS 3.0 created more segments than OS 2.19 in the initial run.
I had forced merged OS 3.0 to 19 segments ( same as OS 2.19 had w/o force merge ) and posted the initial numbers. Also, we had used
Luckily, I still had those indices present in my
The numbers below are from the 3rd run ( although OSB warmup takes care of JIT C2 Compiler optimisations but sometimes the number vary so recording the 3rd run where all optimisations should be done ) OS 3.0
OS 2.19.0
OS 2.19 seems to perform better than OS 3.0 OS 3.0 with OS 2.19.0 index is better
When we use OS 2.19.0 index with OS 3.0, it performs better which is the issue I had seen earlier. The setup details are captured at the meta I have also updated the segment count there, thanks for bringing it up. |
We have observed some skweness in segment sizes with force merging to 5/10 segments across both versions. some segments become as hugh as 23gb, which might skew the results of the perf runs. We have triggered the force merge to 1 and compared the results of 2.19/3.0. 3.0 Seems to be performing batter with forced merge to 1 segment compare to 2.19. Below are the result of the runs with one segment for Opensearch 3.0
Opensearch 2.19
cc: @backslasht / @mgodwan / @expani |
Describe the bug
The latency for
composite_terms-keyword
operation in Big5 has increased by 10-15% with the upgrade to Lucene 10. When using the index from OS 2.19 but searching using OS 3.0 server the latency is better. So, we have narrowed down the cause to something wrong with Index format or we misconfiguring/not configuring something with the Lucene 10 upgrade.OS 3.0 * OS 2.19 indicates that the OpenSearch server was running 3.0 whereas the index used was created in OS 2.19. This was done to eliminate any suspicion of the bug arising from an indexing change in Lucene.
Related component
Search:Performance
To Reproduce
Run
composite_terms-keyword
with Big5 workload and compare with OS 2.19Expected behavior
composite_terms-keyword
should perform the same with OS 3.0 index as it's doing with OS 2.19 indexAdditional Details
Meta
The text was updated successfully, but these errors were encountered: