Recreate TabletsMetadata iterator when file ranges are not contiguous #5341

dlmarion · 2025-02-19T18:31:40Z

In the Bulk Import v2 LoadFiles step a single TabletsMetadata object was used to map a tables tablets to a set of bulk import files. In the case where a small percentage of tablets were involved in the bulk import a majority of the tables tablets would still be evaluated. In the case where bulk imports were not importing into contiguous tablets the code would just iterate over the tables tablets until it found the next starting point.

This change recreates the TabletMetadata object when a set of files is not going to start at the next tablet in the table. A likely better way to achieve the same thing would be to reset the range on the underlying Scanner and create a new iterator, but the TabletsMetadata object does not expose the Scanner. This change also closes the TabletsMetadata objects which was not being done previously.

Related to #5201

In the Bulk Import v2 LoadFiles step a single TabletsMetadata object was used to map a tables tablets to a set of bulk import files. In the case where a small percentage of tablets were involved in the bulk import a majority of the tables tablets would still be evaluated. In the case where bulk imports were not importing into contiguous tablets the code would just iterate over the tables tablets until it found the next starting point. This change recreates the TabletMetadata object when a set of files is not going to start at the next tablet in the table. A likely better way to achieve the same thing would be to reset the range on the underlying Scanner and create a new iterator, but the TabletsMetadata object does not expose the Scanner. This change also closes the TabletsMetadata objects which was not being done previously. Related to apache#5201

dlmarion · 2025-02-19T23:47:11Z

Full IT build passed

ddanielr

Added some logging statements in #5345 to try and see how much time was being spent.

I was running BulkImportSequentialRowsIT directly just to generate some test code and the log shows that the tablet lookup generally took about 10ms - 14ms.

However, with these changes the timestamps jumped up to 64ms - 98ms which doesn't seem great.

Also found that the tablet skipping in findOverlappingTablets didn't seem to be getting called in the existing 2.1 code, but was getting called when these changes were applied.

   // skip tablets until we find the prevEndRow of loadRange
      while ((cmp = PREV_COMP.compare(currTablet.getPrevEndRow(), loadRange.prevEndRow())) < 0) {
        log.trace("{}: Skipping tablet: {}", fmtTid, currTablet.getExtent());
        currTablet = tabletIter.next();
      }

server/manager/src/main/java/org/apache/accumulo/manager/tableOps/bulkVer2/LoadFiles.java

ddanielr

A new iterator is created each time when findOverlappingTablets is passed tm.iterator().

This causes the findOverlappingTablets method to attempt to start at the first row from the metadata builder range and skip ahead to the current range which adds significant delay.

To solve this we need to also track the iterator and assign a new one if the builder's range changes.

        tm = TabletsMetadata.builder(manager.getContext()).forTable(tableId)
            .overlapping(loadMapKey.prevEndRow(), null).checkConsistency()
            .fetch(PREV_ROW, LOCATION, LOADED).build();
        tabletIter = tm.iterator();
        
        List<TabletMetadata> tablets = findOverlappingTablets(fmtTid, loadMapKey, tabletIter);

dlmarion · 2025-02-21T11:49:19Z

Good info, I can look at this today.

dlmarion · 2025-02-21T14:02:42Z

To solve this we need to also track the iterator and assign a new one if the builder's range changes.

I just pushed this change. There is an issue in PrepBulkImport, it's not closing the TabletsMetadata object. Fixing it isn't straight forward, but I'm going to look at it too.

ddanielr

Ran Tests again and new changes are all coming back around 4-10ms.

dlmarion · 2025-02-24T21:42:11Z

I created #5355 to test tablet boundaries on the load files. I think that should be merged before this PR.

keith-turner · 2025-02-25T19:47:30Z

server/manager/src/main/java/org/apache/accumulo/manager/tableOps/bulkVer2/LoadFiles.java

-      List<TabletMetadata> tablets =
-          findOverlappingTablets(fmtTid, loadMapEntry.getKey(), tabletIter);
+      KeyExtent loadMapKey = loadMapEntry.getKey();
+      if (prevLastExtent != null && !loadMapKey.isPreviousExtent(prevLastExtent)) {


In some case this strategy could potentially make performance worse, like the case of importing into every 3rd tablet. The underlying scanner has already made an RPC and fetched some number of key values. Not sure of the best way to do this, but ideally we would only reset the scanner if the needed data is not already sitting in that batch of key values that was already read.

Wondering if using a batch scanner would be better here to minimize the overall number of RPCs made. Would be a large change to the code. The current code, even if we optimize the use of the scanner will make a lot of RPCs for some cases (like importing into every 100th tablet in a million tablet table) and those RPCs will be made serially. A batch scanner would minimize the number of RPCs made for these cases. Batch scanner could also make RPCs in parallel. The Scanner is probably more efficient for reading tablets sequentially, but probably not much slower than the batch scanner. I suspect the batch scanner would not be much slower when data is contiguous and would be much better when data is sparse.

Would be good to gather some performance data before making large changes to improve performance to ensure they are needed. Can not do it in 2.1, but in main we could experiment w/ the SplitMillionIT and try doing things like importing into every 10th tablet for 1000 tablets, every 100th tablet for 1000 tablets, etc.

dlmarion · 2025-02-26T17:06:04Z

Updated with recent changes in 2.1, to include the LoadFilesTest, which still passes.

dlmarion added this to the 2.1.4 milestone Feb 19, 2025

dlmarion requested a review from keith-turner February 19, 2025 18:31

dlmarion self-assigned this Feb 19, 2025

dlmarion linked an issue Feb 19, 2025 that may be closed by this pull request

Bulk import times scale with the number of tablets in a table. #5201

Open

ddanielr reviewed Feb 21, 2025

View reviewed changes

server/manager/src/main/java/org/apache/accumulo/manager/tableOps/bulkVer2/LoadFiles.java Outdated Show resolved Hide resolved

ddanielr requested changes Feb 21, 2025

View reviewed changes

Only create TabletsMetadata iterator when TabletsMetadata changes

50baf8a

Close TabletsMetadata object in PrepBulkImport

30d944b

ddanielr approved these changes Feb 21, 2025

View reviewed changes

Merge branch '2.1' into 5201-targeted-metadata-scans

ec9fa65

ddanielr mentioned this pull request Feb 25, 2025

Created LoadFilesTest #5355

Merged

keith-turner reviewed Feb 25, 2025

View reviewed changes

Merge branch '2.1' into 5201-targeted-metadata-scans

444ad46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Recreate TabletsMetadata iterator when file ranges are not contiguous #5341

Recreate TabletsMetadata iterator when file ranges are not contiguous #5341

dlmarion commented Feb 19, 2025

dlmarion commented Feb 19, 2025

ddanielr left a comment •

edited

Loading

ddanielr left a comment

dlmarion commented Feb 21, 2025

dlmarion commented Feb 21, 2025

ddanielr left a comment

dlmarion commented Feb 24, 2025

keith-turner Feb 25, 2025 •

edited

Loading

keith-turner Feb 25, 2025 •

edited

Loading

dlmarion commented Feb 26, 2025

Recreate TabletsMetadata iterator when file ranges are not contiguous #5341

Are you sure you want to change the base?

Recreate TabletsMetadata iterator when file ranges are not contiguous #5341

Conversation

dlmarion commented Feb 19, 2025

dlmarion commented Feb 19, 2025

ddanielr left a comment • edited Loading

Choose a reason for hiding this comment

ddanielr left a comment

Choose a reason for hiding this comment

dlmarion commented Feb 21, 2025

dlmarion commented Feb 21, 2025

ddanielr left a comment

Choose a reason for hiding this comment

dlmarion commented Feb 24, 2025

keith-turner Feb 25, 2025 • edited Loading

Choose a reason for hiding this comment

keith-turner Feb 25, 2025 • edited Loading

Choose a reason for hiding this comment

dlmarion commented Feb 26, 2025

ddanielr left a comment •

edited

Loading

keith-turner Feb 25, 2025 •

edited

Loading

keith-turner Feb 25, 2025 •

edited

Loading