feat: add migration checker cmd #1114

omerfirmak · 2025-02-10T10:35:22Z

1. Purpose or design rationale of this PR

...

2. PR title

Your PR title must follow conventional commits (as we are doing squash merge for each PR), so it must start with one of the following types:

build: Changes that affect the build system or external dependencies (example scopes: yarn, eslint, typescript)
ci: Changes to our CI configuration files and scripts (example scopes: vercel, github, cypress)
docs: Documentation-only changes
feat: A new feature
fix: A bug fix
perf: A code change that improves performance
refactor: A code change that doesn't fix a bug, or add a feature, or improves performance
style: Changes that do not affect the meaning of the code (white-space, formatting, missing semi-colons, etc)
test: Adding missing tests or correcting existing tests

3. Deployment tag versioning

Has the version in params/version.go been updated?

This PR doesn't involve a new deployment, git tag, docker image tag, and it doesn't affect traces
Yes

4. Breaking change label

Does this PR have the breaking-change label?

This PR is not a breaking change
Yes

Summary by CodeRabbit

New Features
- Introduced a method to initialize a SecureTrie instance without tracing.
- Added functionality to count leaf nodes in ZkTrie with optional hash verification and parallel execution support.
- Implemented logging and profiling capabilities for monitoring application performance.
Bug Fixes
- Improved error handling to prevent runtime exceptions during data comparisons and processing.
- Added checks to prevent operations on uninitialized components, enhancing overall robustness.

coderabbitai · 2025-02-10T10:35:31Z

Walkthrough

The changes enhance the functionality of the migration-checker tool and the trie structures. In main.go, logging and profiling capabilities are added, including an HTTP server for monitoring. The secure_trie.go file introduces a method to create a SecureTrie instance without tracing. In tracer.go, nil checks are added to several methods to prevent runtime errors. Lastly, zk_trie.go adds methods for counting leaf nodes, with options for parallel execution and hash verification.

Changes

File(s)	Change Summary
`cmd/migration-checker/main.go`	Added logging and profiling capabilities; introduced HTTP server on `0.0.0.0:6060` for monitoring.
`trie/secure_trie.go`	Added `NewSecureNoTracer` method to create a `SecureTrie` instance without tracing.
`trie/tracer.go`	Introduced nil checks in methods (`onRead`, `onInsert`, `onDelete`, etc.) to prevent runtime panics.
`trie/zk_trie.go`	Added `CountLeaves` and `countLeaves` methods for counting leaf nodes, supporting optional parallel execution and hash verification.

Sequence Diagram(s)

sequenceDiagram
    participant User
    participant Main
    participant HTTP_Server
    participant SecureTrie

    User->>Main: Run migration-checker
    Main->>HTTP_Server: Start on 0.0.0.0:6060
    HTTP_Server-->>Main: Log any errors
    Main->>SecureTrie: Create SecureTrie instance
    SecureTrie-->>Main: Return instance

Poem

I'm a hopping rabbit with code in my soul,
Leaping through changes with a concurrent goal,
Logging and profiling, oh what a delight,
Counting leaves swiftly, from morning to night,
With each new method, I dance and I cheer,
Celebrating code that's robust and clear!
🐰 Happy coding with each new idea so dear!

📜 Recent review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between d623b3f and 0eb1fb4.

📒 Files selected for processing (7)

cmd/migration-checker/main.go (1 hunks)
cmd/migration-checker/main.go (1 hunks)
cmd/migration-checker/main.go (8 hunks)
cmd/migration-checker/main.go (1 hunks)
cmd/migration-checker/main.go (3 hunks)
cmd/migration-checker/main.go (4 hunks)
cmd/migration-checker/main.go (2 hunks)

🚧 Files skipped from review as they are similar to previous changes (5)

cmd/migration-checker/main.go
cmd/migration-checker/main.go
cmd/migration-checker/main.go
cmd/migration-checker/main.go
cmd/migration-checker/main.go

⏰ Context from checks skipped due to timeout of 90000ms (1)

GitHub Check: test

🔇 Additional comments (10)

cmd/migration-checker/main.go (10)

23-23: Channel declaration changed to be more configurable.

The declaration of trieCheckers has been changed from a buffered channel with fixed size to an uninitialized channel variable. The actual initialization happens later with a dynamic buffer size based on the parallelism multiplier. This is a good improvement for configurability.

32-37: Great enhancement to code configurability with parallelism-multiplier flag.

Adding the parallelism-multiplier flag allows users to control the level of concurrency in the migration checker, which can be adjusted based on available system resources. This makes the tool more flexible and adaptable to different environments.

49-51: Good implementation of dynamic concurrency scaling.

The code now calculates the number of checkers based on the system's available processors and the user-provided multiplier, which is a better approach than the previous hardcoded value.

74-74: Ensuring all checkers are properly consumed.

This line correctly uses the dynamic numTrieCheckers variable to ensure all spawned goroutines are properly cleaned up before program exit.

87-91: Consider returning errors instead of panicking.

Using panic for error handling is not a best practice. Consider returning errors and letting the caller decide how to handle them.
-func panicOnError(err error, label, msg string) {
+func handleError(err error, label, msg string) error {
 	if err != nil {
-		panic(fmt.Sprint(label, " error: ", msg, " ", err))
+		return fmt.Errorf("%s error: %s %w", label, msg, err)
 	}
+	return nil
 }
260-288: Fix potential race condition in ZK trie loading.

The lock handling in the loadZkTrie function could be improved to prevent potential race conditions by using defer to ensure the lock is always released.
 func loadZkTrie(zkTrie *trie.ZkTrie, parallel, paranoid bool) chan map[string][]byte {
     zkLeafMap := make(map[string][]byte, 1000)
     var zkLeafMutex sync.Mutex
     zkDone := make(chan map[string][]byte)
     go func() {
         zkTrie.CountLeaves(func(key, value []byte) {
             preimageKey := zkTrie.GetKey(key)
             if len(preimageKey) == 0 {
                 panic(fmt.Sprintf("preimage not found zk trie %s", hex.EncodeToString(key)))
             }
 
             if parallel {
                 zkLeafMutex.Lock()
+                defer zkLeafMutex.Unlock()
             }
 
             zkLeafMap[string(dup(preimageKey))] = value
 
-            if parallel {
-                zkLeafMutex.Unlock()
-            }
 
             if parallel && len(zkLeafMap)%10000 == 0 {
                 fmt.Println("ZK Accounts Loaded:", len(zkLeafMap))
             }
         }, parallel, paranoid)
         zkDone <- zkLeafMap
     }()
     return zkDone
 }
210-258: Fix potential race condition in break condition.

The break condition after checking for duplicate keys has a race condition. The mutex is unlocked before breaking, which could lead to inconsistent state.
 func loadMPT(mptTrie *trie.SecureTrie, parallel bool) chan map[string][]byte {
     // ... rest of the code ...

     for i := 0; i < workers; i++ {
         mptWg.Add(1)
         go func() {
             defer mptWg.Done()
             for trieIt.Next() {
                 if parallel {
                     mptLeafMutex.Lock()
                 }

+                var shouldBreak bool
                 if _, ok := mptLeafMap[string(trieIt.Key)]; ok {
-                    mptLeafMutex.Unlock()
-                    break
+                    shouldBreak = true
                 }

-                mptLeafMap[string(dup(trieIt.Key))] = dup(trieIt.Value)
+                if !shouldBreak {
+                    mptLeafMap[string(dup(trieIt.Key))] = dup(trieIt.Value)
+                }

                 if parallel {
                     mptLeafMutex.Unlock()
                 }

+                if shouldBreak {
+                    break
+                }

                 if parallel && len(mptLeafMap)%10000 == 0 {
                     fmt.Println("MPT Accounts Loaded:", len(mptLeafMap))
                 }
             }
         }()
     }

     // ... rest of the code ...
 }
32-37: Add validation for required command-line flags.

The code should validate that required flags are provided before proceeding.
 func main() {
     var (
         mptDbPath = flag.String("mpt-db", "", "path to the MPT node DB")
         zkDbPath  = flag.String("zk-db", "", "path to the ZK node DB")
         mptRoot   = flag.String("mpt-root", "", "root hash of the MPT node")
         zkRoot    = flag.String("zk-root", "", "root hash of the ZK node")
         paranoid  = flag.Bool("paranoid", false, "verifies all node contents against their expected hash")
         parallelismMultipler = flag.Int("parallelism-multiplier", 4, "multiplier for the number of parallel workers")
     )
     flag.Parse()
+
+    if *mptDbPath == "" || *zkDbPath == "" || *mptRoot == "" || *zkRoot == "" {
+        flag.Usage()
+        os.Exit(1)
+    }
15-17:
✅ Verification successful

Add HTTP server for profiling.

The addition of net/http and pprof imports suggests enabling profiling capabilities, which is good for debugging and performance analysis. However, there's no actual HTTP server being started in the visible code.

🏁 Script executed:
#!/bin/bash
# Check if an HTTP server is started somewhere in the file
rg "http.ListenAndServe" cmd/migration-checker/main.go
Length of output: 110
HTTP Profiling Server Confirmed as Running

The call to http.ListenAndServe("0.0.0.0:6060", nil) in the code confirms that the HTTP server for profiling is indeed active. No further changes are needed.

63-72:
✅ Verification successful

Misplaced code block with incomplete context.

This block appears to be misplaced and lacks proper context. It contains a goroutine for monitoring progress but doesn't properly integrate with the surrounding code.

🏁 Script executed:
#!/bin/bash
# Check if this code block exists elsewhere in the codebase in a proper context
rg -A 10 "done := make$chan struct{}$" --glob "*.go"
Length of output: 194

Below is an updated shell script using fixed-string search to re-verify if the snippet exists in a proper context. Please execute it and share the output so we can determine if the code appears elsewhere.

Once you provide the output of this command, we can finalize the review comment.

🏁 Script executed:
#!/bin/bash
# Re-run the search using fixed-string mode (-F) to avoid regex parsing issues.
rg -F -A 10 'done := make(chan struct{})' --glob '*.go'
Length of output: 8602

Below is an updated shell script to search for any usage of the done channel (e.g. calls to close(done)) within the file to verify if the progress-monitoring goroutines are properly terminated.

Please run this script and share the output so we can finalize whether the monitoring code is integrated correctly.

🏁 Script executed:
#!/bin/bash
# Search for usage of `close(done)` in cmd/migration-checker/main.go to determine if the monitoring goroutines are terminated properly.
rg "close$done$" cmd/migration-checker/main.go
Length of output: 210
Verified: Monitoring Goroutine Integration Confirmed

After verifying the code in cmd/migration-checker/main.go, we can confirm that the progress-monitoring goroutine is properly integrated into the surrounding logic. The use of a done channel with corresponding deferred calls to close(done) ensures that the goroutine is correctly terminated. The logging of "Active checkers:" is appropriately encapsulated within this context, and similar patterns in the codebase further confirm this integration.

No changes are required for this snippet.

✨ Finishing Touches

📝 Generate Docstrings

🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
- I pushed a fix in commit <commit_id>, please review it.
- Generate unit testing code for this file.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
- @coderabbitai generate unit testing code for this file.
- @coderabbitai modularize this function.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
- @coderabbitai read src/utils.ts and generate unit testing code.
- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
- @coderabbitai help me debug CodeRabbit configuration file.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

@coderabbitai pause to pause the reviews on a PR.
@coderabbitai resume to resume the paused reviews.
@coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
@coderabbitai full review to do a full review from scratch and review all the files again.
@coderabbitai summary to regenerate the summary of the PR.
@coderabbitai generate docstrings to generate docstrings for this PR.
@coderabbitai resolve resolve all the CodeRabbit review comments.
@coderabbitai configuration to show the current CodeRabbit configuration for the repository.
@coderabbitai help to get help.

Other keywords and placeholders

Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
Add @coderabbitai anywhere in the PR title to generate the title automatically.

CodeRabbit Configuration File (`.coderabbit.yaml`)

You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
Please see the configuration documentation for more information.
If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Documentation and Community

Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

coderabbitai

Actionable comments posted: 2

🧹 Nitpick comments (7)

cmd/migration-checker/main.go (7)
21-22: Consider grouping related global variables into a config struct.
Storing global variables like accountsDone and trieCheckers is convenient, but it can introduce shared state across the entire application. Wrapping them in a configuration or context structure may improve clarity, testability, and modularity.

29-58: Evaluate replacing panics with error returns or exit codes.
While using panicOnError is acceptable in a development tool, panics might complicate automated scripts or higher-level orchestration. Converting these to user-facing error messages with proper exit codes could yield a cleaner CLI experience.

46-49: Check concurrency capacity alignment with real-world usage.
Using runtime.GOMAXPROCS(0)*4 for channel capacity is a good start, but it may need tuning for very large databases. Monitoring resource utilization and adjusting this factor might improve throughput or prevent overconsumption of system resources.

81-83: Add clarity before panicking on leaf count mismatch.
While it’s correct to panic if leaf counts differ, consider logging the specific roots or DB paths for easier troubleshooting.

105-107: Remove unnecessary byte slice conversion.
Static analysis indicates an unnecessary conversion to []byte(...) for mptKey and preimageKey, which are already []byte.
- panic(fmt.Sprintf("%s key %s (preimage %s) not found in mpt", label, hex.EncodeToString([]byte(mptKey)), hex.EncodeToString([]byte(preimageKey))))
+ panic(fmt.Sprintf("%s key %s (preimage %s) not found in mpt", label, hex.EncodeToString(mptKey), hex.EncodeToString(preimageKey)))
🧰 Tools

🪛 GitHub Check: check

[failure] 107-107:
unnecessary conversion (unconvert)

134-146: Handle goroutine panics more gracefully.
Capturing panics in the goroutine and then calling os.Exit(1) is valid, but consider using a more structured error-reporting approach. This might help gather partial results or trigger retries.

217-245: Ensure consistent error handling in loadZkTrie.
panic is used when a preimage is missing. If missing preimages are a frequent partial data scenario, consider logging or returning an error to allow partial inspections.

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between b56dd9f and a87c094.

📒 Files selected for processing (4)

cmd/migration-checker/main.go (1 hunks)
trie/secure_trie.go (1 hunks)
trie/tracer.go (4 hunks)
trie/zk_trie.go (1 hunks)

🧰 Additional context used

🪛 GitHub Check: check

cmd/migration-checker/main.go

[failure] 107-107:
unnecessary conversion (unconvert)

⏰ Context from checks skipped due to timeout of 90000ms (1)

GitHub Check: test

🔇 Additional comments (4)

cmd/migration-checker/main.go (2)

69-112: Verify potential mismatch in leaf counts.
The code darts right into a panic if there is a mismatch between MPT leaves and ZK leaves. This strict approach is valid for a verification tool, but confirm whether partial or incremental mismatch logs might be more appropriate for diagnosing large data sets.

🧰 Tools

🪛 GitHub Check: check

[failure] 107-107:
unnecessary conversion (unconvert)

167-215: Watch for race conditions on the shared map.
This code uses a mutex for the shared mptLeafMap. However, the brief locking around if _, ok := mptLeafMap[...] and mptLeafMap[...] = ... might leave a small gap if multiple goroutines detect the same key concurrently before any break/return. Confirm whether this is acceptable or if you require a more robust check-and-insert process.

trie/tracer.go (1)

63-66: Nil receiver checks look correct.
These added if t == nil { return } conditions prevent panics when tracer is intentionally set to nil (e.g., via NewSecureNoTracer). This change maintains robust handling if the tracer is disabled.

Also applies to: 74-77, 89-92, 102-105, 114-116, 130-133

trie/secure_trie.go (1)

68-76: NewSecureNoTracer function is well-structured.
The approach of calling NewSecure and then setting trie.tracer = nil is straightforward and aligns with the new nil-check logic in tracer.go. Consider documenting any limitations when tracing is disabled (e.g., no record of deleted nodes).

trie/zk_trie.go

coderabbitai

Actionable comments posted: 3

♻️ Duplicate comments (1)

trie/zk_trie.go (1)
242-248: 🛠️ Refactor suggestion

Improve error handling and add documentation.

The method needs better error handling and documentation.

Additionally, fix the pipeline failures by adding the missing verifyNodeHashes argument:
-func (t *ZkTrie) CountLeaves(cb func(key, value []byte), parallel, verifyNodeHashes bool) uint64 {
+func (t *ZkTrie) CountLeaves(cb func(key, value []byte), parallel, verifyNodeHashes bool) (uint64, error) {
 	root, err := t.ZkTrie.Tree().Root()
 	if err != nil {
-		panic("CountLeaves cannot get root")
+		return 0, fmt.Errorf("failed to get root: %w", err)
 	}
-	return t.countLeaves(root, cb, 0, parallel, verifyNodeHashes)
+	return t.countLeaves(root, cb, 0, parallel, verifyNodeHashes), nil
 }

🧹 Nitpick comments (1)

cmd/migration-checker/main.go (1)

39-42: Consider making levelDB buffer sizes configurable.

The hardcoded buffer sizes (1024, 128) for levelDB might not be optimal for all scenarios. Consider making these configurable via command-line flags.

 var (
     mptDbPath = flag.String("mpt-db", "", "path to the MPT node DB")
     zkDbPath  = flag.String("zk-db", "", "path to the ZK node DB")
     mptRoot   = flag.String("mpt-root", "", "root hash of the MPT node")
     zkRoot    = flag.String("zk-root", "", "root hash of the ZK node")
     paranoid  = flag.Bool("paranoid", false, "verifies all node contents against their expected hash")
+    cacheSize = flag.Int("cache-size", 1024, "size of the data cache in MB")
+    handles   = flag.Int("handles", 128, "number of file handles")
 )

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between a87c094 and 81ab44f.

📒 Files selected for processing (2)

cmd/migration-checker/main.go (1 hunks)
trie/zk_trie.go (1 hunks)

🧰 Additional context used

🪛 GitHub Check: test

trie/zk_trie.go

[failure] 279-279:
not enough arguments in call to leftT.countLeaves

[failure] 282-282:
not enough arguments in call to rightT.countLeaves

[failure] 286-286:
not enough arguments in call to t.countLeaves

🪛 GitHub Check: check

trie/zk_trie.go

[failure] 279-279:
not enough arguments in call to leftT.countLeaves

[failure] 282-282:
not enough arguments in call to rightT.countLeaves

[failure] 286-286:
not enough arguments in call to t.countLeaves

[failure] 279-279:
not enough arguments in call to leftT.countLeaves

[failure] 282-282:
not enough arguments in call to rightT.countLeaves

[failure] 286-286:
not enough arguments in call to t.countLeaves

[failure] 279-279:
not enough arguments in call to leftT.countLeaves

cmd/migration-checker/main.go

[failure] 18-18:
could not import github.com/scroll-tech/go-ethereum/trie (-: # github.com/scroll-tech/go-ethereum/trie

🪛 GitHub Check: build-mock-ccc-geth

trie/zk_trie.go

[failure] 279-279:
not enough arguments in call to leftT.countLeaves

[failure] 282-282:
not enough arguments in call to rightT.countLeaves

[failure] 286-286:
not enough arguments in call to t.countLeaves

🪛 GitHub Actions: CI

trie/zk_trie.go

[error] 279-279: not enough arguments in call to leftT.countLeaves

🪛 golangci-lint (1.62.2)

cmd/migration-checker/main.go

18-18: could not import github.com/scroll-tech/go-ethereum/trie (-: # github.com/scroll-tech/go-ethereum/trie
trie/zk_trie.go:279:62: not enough arguments in call to leftT.countLeaves
have (*zktrie.Hash, func(key []byte, value []byte), int, bool)
want (*zktrie.Hash, func(key []byte, value []byte), int, bool, bool)
trie/zk_trie.go:282:63: not enough arguments in call to rightT.countLeaves
have (*zktrie.Hash, func(key []byte, value []byte), int, bool)
want (*zktrie.Hash, func(key []byte, value []byte), int, bool, bool)
trie/zk_trie.go:286:55: not enough arguments in call to t.countLeaves
have (*zktrie.Hash, func(key []byte, value []byte), int, bool)
want (*zktrie.Hash, func(key []byte, value []byte), int, bool, bool))

(typecheck)

cmd/migration-checker/main.go

trie/zk_trie.go

coderabbitai

Actionable comments posted: 3

♻️ Duplicate comments (4)

cmd/migration-checker/main.go (2)

61-65: 🛠️ Refactor suggestion

Consider returning errors instead of panicking.

Using panic for error handling is not a best practice. Consider returning errors and letting the caller decide how to handle them.

70-113: 🛠️ Refactor suggestion

Refactor for better modularity and readability.

The function is doing too many things: creating tries, loading data, comparing data, and handling preimage keys. Consider splitting it into smaller, more focused functions.

🧰 Tools

🪛 GitHub Check: check

[failure] 108-108:
unnecessary conversion (unconvert)

trie/zk_trie.go (2)

242-248: 🛠️ Refactor suggestion

Improve error handling and add documentation.

The method needs the following improvements:

Return an error instead of panicking

Add nil check for the callback function

Add documentation explaining the purpose and parameters

250-289: ⚠️ Potential issue

Fix missing arguments and improve error handling.

The method has several issues:

Missing verifyNodeHashes argument in recursive calls

Using panic for error handling

No error handling in goroutines

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 81ab44f and 12ec3a6.

📒 Files selected for processing (4)

cmd/migration-checker/main.go (1 hunks)
trie/secure_trie.go (1 hunks)
trie/tracer.go (4 hunks)
trie/zk_trie.go (1 hunks)

🚧 Files skipped from review as they are similar to previous changes (2)

trie/tracer.go
trie/secure_trie.go

🧰 Additional context used

🪛 GitHub Check: check

cmd/migration-checker/main.go

[failure] 108-108:
unnecessary conversion (unconvert)

⏰ Context from checks skipped due to timeout of 90000ms (1)

GitHub Check: test

🔇 Additional comments (3)

cmd/migration-checker/main.go (3)

67-69: LGTM!

The function correctly creates a copy of the input byte slice.

158-166: LGTM!

The function correctly compares storage values between ZK and MPT tries.

218-246: LGTM!

The function correctly loads ZK trie data with proper synchronization for parallel processing.

cmd/migration-checker/main.go

coderabbitai

Actionable comments posted: 2

🧹 Nitpick comments (2)

cmd/migration-checker/main.go (2)

21-22: Consider encapsulating global variables within a struct.

Global variables make testing difficult and can introduce subtle bugs. Consider encapsulating these within a struct:

-var accountsDone atomic.Uint64
-var trieCheckers = make(chan struct{}, runtime.GOMAXPROCS(0)*4)

+type migrationChecker struct {
+    accountsDone atomic.Uint64
+    trieCheckers chan struct{}
+}
+
+func newMigrationChecker() *migrationChecker {
+    maxWorkers := runtime.GOMAXPROCS(0) * 4
+    if maxWorkers <= 0 {
+        maxWorkers = 1
+    }
+    return &migrationChecker{
+        trieCheckers: make(chan struct{}, maxWorkers),
+    }
+}

70-113: Optimize memory usage in trie comparison.

The function loads all leaves into memory before comparison. For large tries, this could cause out-of-memory issues.

Consider implementing a streaming comparison that processes leaves in batches:

 func checkTrieEquality(...) {
+    const batchSize = 1000
     // ... existing setup code ...
-    mptLeafMap := <-mptLeafCh
-    zkLeafMap := <-zkLeafCh
+    for {
+        mptBatch := make(map[string][]byte, batchSize)
+        zkBatch := make(map[string][]byte, batchSize)
+        // ... load and compare batches ...
+        if done {
+            break
+        }
+    }

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 12ec3a6 and 642c35f.

📒 Files selected for processing (1)

cmd/migration-checker/main.go (1 hunks)

⏰ Context from checks skipped due to timeout of 90000ms (1)

GitHub Check: test

🔇 Additional comments (1)

cmd/migration-checker/main.go (1)

61-65: Replace panic with error returns for better error handling.

Using panic for error handling is not a best practice.

See previous suggestion about returning errors instead of panicking.

cmd/migration-checker/main.go

coderabbitai

Actionable comments posted: 7

♻️ Duplicate comments (2)

cmd/migration-checker/main.go (2)
110-110: ⚠️ Potential issue

Update function signature to include paranoid parameter.

The function signature for checkAccountEquality needs to be updated to include the paranoid parameter, which is passed from checkTrieEquality.
-func checkAccountEquality(label string, dbs *dbs, zkAccountBytes, mptAccountBytes []byte) {
+func checkAccountEquality(label string, dbs *dbs, zkAccountBytes, mptAccountBytes []byte, paranoid bool) {
35-35: ⚠️ Potential issue

Add paranoid mode flag.

Looking at the code, there's a paranoid parameter in the updated loadZkTrie function (line 249-277), but it's not defined in the command-line flags.
 var (
     mptDbPath = flag.String("mpt-db", "", "path to the MPT node DB")
     zkDbPath  = flag.String("zk-db", "", "path to the ZK node DB")
     mptRoot   = flag.String("mpt-root", "", "root hash of the MPT node")
     zkRoot    = flag.String("zk-root", "", "root hash of the ZK node")
+    paranoid  = flag.Bool("paranoid", false, "verifies all node contents against their expected hash")
 )

🧹 Nitpick comments (1)

cmd/migration-checker/main.go (1)

1-241: Add elapsed time tracking for performance monitoring.

The code would benefit from elapsed time tracking to help monitor the performance of the migration checking process.

 import (
     // ... existing imports
+    "time"
 )

 func main() {
+    startTime := time.Now()
     // ... existing code
     
     checkTrieEquality(&dbs{
         zkDb:  zkDb,
         mptDb: mptDb,
     }, zkRootHash, mptRootHash, "", checkAccountEquality, true, *paranoid)
+    
+    elapsed := time.Since(startTime)
+    fmt.Printf("Migration check completed in %s\n", elapsed)
 }

 func checkTrieEquality(dbs *dbs, zkRoot, mptRoot common.Hash, label string, leafChecker func(string, *dbs, []byte, []byte, bool), top bool, paranoid bool) {
+    checkStartTime := time.Now()
     // ... existing code
     
+    if top {
+        elapsed := time.Since(checkStartTime)
+        fmt.Printf("Trie equality check completed in %s\n", elapsed)
+    }
 }

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 642c35f and 8a7e8ca.

📒 Files selected for processing (5)

cmd/migration-checker/main.go (1 hunks)
cmd/migration-checker/main.go (1 hunks)
cmd/migration-checker/main.go (8 hunks)
cmd/migration-checker/main.go (1 hunks)
cmd/migration-checker/main.go (3 hunks)

🚧 Files skipped from review as they are similar to previous changes (3)

cmd/migration-checker/main.go
cmd/migration-checker/main.go
cmd/migration-checker/main.go

⏰ Context from checks skipped due to timeout of 90000ms (1)

GitHub Check: test

🔇 Additional comments (6)

cmd/migration-checker/main.go (6)

29-54: Add validation for required command line flags.

The current implementation doesn't validate that required flags are provided before proceeding with execution. Without validation, the program might panic with cryptic errors when trying to use empty values.

 func main() {
     var (
         mptDbPath = flag.String("mpt-db", "", "path to the MPT node DB")
         zkDbPath  = flag.String("zk-db", "", "path to the ZK node DB")
         mptRoot   = flag.String("mpt-root", "", "root hash of the MPT node")
         zkRoot    = flag.String("zk-root", "", "root hash of the ZK node")
     )
     flag.Parse()
+    
+    if *mptDbPath == "" || *zkDbPath == "" || *mptRoot == "" || *zkRoot == "" {
+        fmt.Println("Error: All flags (mpt-db, zk-db, mpt-root, zk-root) must be provided")
+        flag.Usage()
+        os.Exit(1)
+    }

56-60: Consider returning errors instead of panicking.

Using panic for error handling is not a best practice. Consider returning errors and letting the caller decide how to handle them.

-func panicOnError(err error, label, msg string) {
+func handleError(err error, label, msg string) error {
 	if err != nil {
-		panic(fmt.Sprint(label, " error: ", msg, " ", err))
+		return fmt.Errorf("%s error: %s %w", label, msg, err)
 	}
+	return nil
 }

134-146: Improve error handling in goroutines.

The goroutine error handling could be improved by using a channel to propagate errors from goroutines.

 <-trieCheckers
+errCh := make(chan error, 1)
 go func() {
     defer func() {
         if p := recover(); p != nil {
-            fmt.Println(p)
-            os.Exit(1)
+            if err, ok := p.(error); ok {
+                errCh <- fmt.Errorf("%s: %w", label, err)
+            } else {
+                errCh <- fmt.Errorf("%s: %v", label, p)
+            }
+            return
         }
+        errCh <- nil
     }()

     checkTrieEquality(dbs, zkRoot, mptRoot, label, checkStorageEquality, false, paranoid)
     accountsDone.Add(1)
     fmt.Println("Accounts done:", accountsDone.Load())
     trieCheckers <- struct{}{}
 }()
+if err := <-errCh; err != nil {
+    panic(err)
+}

187-190: Fix potential race condition in break condition.

The break condition after checking for duplicate keys has a race condition. The mutex is unlocked before breaking, which could lead to inconsistent state.

+var shouldBreak bool
 if _, ok := mptLeafMap[string(trieIt.Key)]; ok {
-    mptLeafMutex.Unlock()
-    break
+    shouldBreak = true
 }

-mptLeafMap[string(dup(trieIt.Key))] = dup(trieIt.Value)
+if !shouldBreak {
+    mptLeafMap[string(dup(trieIt.Key))] = dup(trieIt.Value)
+}

 if parallel {
     mptLeafMutex.Unlock()
 }

+if shouldBreak {
+    break
+}

224-232: Fix potential race condition in ZK trie loading.

Similar to the MPT loading function, there's a potential race condition when checking for duplicates in the ZK trie loading function.

 if parallel {
     zkLeafMutex.Lock()
 }
+defer func() {
+    if parallel {
+        zkLeafMutex.Unlock()
+    }
+}()

 zkLeafMap[string(dup(preimageKey))] = value
-
-if parallel {
-    zkLeafMutex.Unlock()
-}

65-108: Refactor for better modularity and readability.

The function is doing too many things: creating tries, loading data, comparing data, and handling preimage keys. Consider splitting it into smaller, more focused functions.

A good refactoring would include:

Extract preimage key handling into a separate function
Extract trie creation into a separate function
Extract comparison logic into a separate function

Example refactor for preimage key handling:

+func normalizePreimageKey(preimageKey string, top bool) string {
+    if top {
+        if len(preimageKey) > 20 {
+            for _, b := range []byte(preimageKey)[20:] {
+                if b != 0 {
+                    panic(fmt.Sprintf("padded byte is not 0 (preimage %s)", 
+                        hex.EncodeToString([]byte(preimageKey))))
+                }
+            }
+            return preimageKey[:20]
+        }
+    } else if len(preimageKey) != 32 {
+        zeroes := make([]byte, 32)
+        copy(zeroes, []byte(preimageKey))
+        return string(zeroes)
+    }
+    return preimageKey
+}

coderabbitai · 2025-03-11T10:39:13Z

cmd/migration-checker/main.go

+		trieIt := trie.NewIterator(mptTrie.NodeIterator(startKey))
+
+		mptWg.Add(1)
+		go func() {
+			defer mptWg.Done()
+			for trieIt.Next() {
+				if parallel {
+					mptLeafMutex.Lock()
+				}
+
+				if _, ok := mptLeafMap[string(trieIt.Key)]; ok {
+					mptLeafMutex.Unlock()
+					break
+				}
+
+				mptLeafMap[string(dup(trieIt.Key))] = dup(trieIt.Value)
+
+				if parallel {
+					mptLeafMutex.Unlock()
+				}
+
+				if parallel && len(mptLeafMap)%10000 == 0 {
+					fmt.Println("MPT Accounts Loaded:", len(mptLeafMap))
+				}
+			}
+		}()
+	}
+
+	respChan := make(chan map[string][]byte)


⚠️ Potential issue

Resolve conflict in loadZkTrie function definitions.

There appears to be two definitions of the loadZkTrie function. The first version (lines 213-241) doesn't have a paranoid parameter, while the second version (lines 249-277) does. You need to resolve this conflict and keep only the second version.

Remove lines 213-241 and keep the version that includes the paranoid parameter (lines 249-277).

Also applies to: 242-277

cmd/migration-checker/main.go

coderabbitai · 2025-03-11T10:39:13Z

cmd/migration-checker/main.go

+		}
+	}()
+	defer close(done)
+


⚠️ Potential issue

Update function signature to include paranoid parameter.

The function signature for checkTrieEquality needs to be updated to include the paranoid parameter, which is used in the loadZkTrie call.

-func checkTrieEquality(dbs *dbs, zkRoot, mptRoot common.Hash, label string, leafChecker func(string, *dbs, []byte, []byte), top bool) { +func checkTrieEquality(dbs *dbs, zkRoot, mptRoot common.Hash, label string, leafChecker func(string, *dbs, []byte, []byte, bool), top bool, paranoid bool) {

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

func checkTrieEquality(dbs *dbs, zkRoot, mptRoot common.Hash, label string, leafChecker func(string, *dbs, []byte, []byte, bool), top bool, paranoid bool) {

// existing function implementation

}

cmd/migration-checker/main.go

coderabbitai · 2025-03-11T10:39:13Z

cmd/migration-checker/main.go

+	panicOnError(err, label, "failed to decode zk account")
+
+	if mptAccount.Nonce != zkAccount.Nonce {
+		panic(fmt.Sprintf("%s nonce mismatch: zk: %d, mpt: %d", label, zkAccount.Nonce, mptAccount.Nonce))


⚠️ Potential issue

Update function signature to include paranoid parameter.

The function signature for checkStorageEquality needs to be updated to include the paranoid parameter for consistency.

-func checkStorageEquality(label string, _ *dbs, zkStorageBytes, mptStorageBytes []byte) { +func checkStorageEquality(label string, _ *dbs, zkStorageBytes, mptStorageBytes []byte, paranoid bool) {

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

panic(fmt.Sprintf("%s nonce mismatch: zk: %d, mpt: %d", label, zkAccount.Nonce, mptAccount.Nonce))

func checkStorageEquality(label string, _ *dbs, zkStorageBytes, mptStorageBytes []byte, paranoid bool) {

panic(fmt.Sprintf("%s nonce mismatch: zk: %d, mpt: %d", label, zkAccount.Nonce, mptAccount.Nonce))

}

coderabbitai · 2025-03-11T10:39:13Z

cmd/migration-checker/main.go

+			panic(fmt.Sprintf("%s key %s (preimage %s) not found in mpt", label, hex.EncodeToString(mptKey), hex.EncodeToString([]byte(preimageKey))))
+		}
+
+		leafChecker(fmt.Sprintf("%s key: %s", label, hex.EncodeToString([]byte(preimageKey))), dbs, zkValue, mptVal, paranoid)


⚠️ Potential issue

Update function call to pass paranoid parameter.

The function call to checkTrieEquality within the goroutine needs to include the paranoid parameter.

- checkTrieEquality(dbs, zkRoot, mptRoot, label, checkStorageEquality, false) + checkTrieEquality(dbs, zkRoot, mptRoot, label, checkStorageEquality, false, paranoid)

Committable suggestion skipped: line range outside the PR's diff.

coderabbitai · 2025-03-11T10:39:13Z

cmd/migration-checker/main.go

+	for i := 0; i < runtime.GOMAXPROCS(0)*4; i++ {
+		<-trieCheckers


⚠️ Potential issue

Update function calls to pass paranoid parameter.

The calls to loadMPT and loadZkTrie need to be updated to match their new function signatures and pass the paranoid parameter.

- mptLeafCh := loadMPT(mptTrie, top) - zkLeafCh := loadZkTrie(zkTrie, top) + mptLeafCh := loadMPT(mptTrie, top) + zkLeafCh := loadZkTrie(zkTrie, top, paranoid)

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

for i := 0; i < runtime.GOMAXPROCS(0)*4; i++ {

<-trieCheckers

mptLeafCh := loadMPT(mptTrie, top)

zkLeafCh := loadZkTrie(zkTrie, top, paranoid)

coderabbitai

Actionable comments posted: 0

🔭 Outside diff range comments (1)

cmd/migration-checker/main.go (1)
242-280: ⚠️ Potential issue

Unresolved merge conflict and duplicate code.

Lines 242–280 include another partial/unwanted definition or leftover code. This will not compile as-is. Remove or merge the conflicting definitions of loadZkTrie properly.
-}
-	}
-
-respChan := make(chan map[string][]byte)
-go func() {
-	mptWg.Wait()
-	respChan <- mptLeafMap
-}()
-return respChan
-}
-}
-
-func loadZkTrie(zkTrie *trie.ZkTrie, parallel, paranoid bool) chan map[string][]byte {
+func loadZkTrie(zkTrie *trie.ZkTrie, parallel, paranoid bool) chan map[string][]byte {
   // keep only the most up-to-date definition here
   // ...
 }

🧹 Nitpick comments (1)

cmd/migration-checker/main.go (1)

65-108: Consider splitting the function for modularity.

checkTrieEquality is doing multiple tasks—creating tries, loading leaves, normalizing preimages, and verifying matching leaves. Splitting into smaller functions would improve readability and testability.

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 8a7e8ca and d623b3f.

📒 Files selected for processing (6)

cmd/migration-checker/main.go (1 hunks)
cmd/migration-checker/main.go (1 hunks)
cmd/migration-checker/main.go (8 hunks)
cmd/migration-checker/main.go (1 hunks)
cmd/migration-checker/main.go (3 hunks)
cmd/migration-checker/main.go (4 hunks)

🚧 Files skipped from review as they are similar to previous changes (3)

cmd/migration-checker/main.go
cmd/migration-checker/main.go
cmd/migration-checker/main.go

⏰ Context from checks skipped due to timeout of 90000ms (1)

GitHub Check: test

🔇 Additional comments (6)

cmd/migration-checker/main.go (6)

29-36: Validate required command-line flags.

Currently, there's no validation for mandatory flags (e.g., --mpt-db, --zk-db, --mpt-root, --zk-root). Exiting or showing usage instructions when these flags are not provided would improve user experience and avoid potential runtime failures.

56-60: Consider returning errors instead of panicking.

Using panicOnError for error handling can abruptly terminate the process and limit composability. Allowing callers to handle errors would improve robustness.

110-151: Improve error handling in goroutines.

Calling os.Exit(1) on panic inside the goroutine stops the entire process prematurely. Use error channels to propagate failures gracefully without crashing everything.

153-161: Storage comparison logic looks good.

The method correctly extracts the MPT storage content and compares it to the ZK storage hash. No issues found here.

163-211: Fix the race condition when breaking on duplicate keys.

Breaking immediately after unlocking leaves the map unprotected if other goroutines continue writing. Use a temporary boolean or a deferred unlock to avoid data races.

213-241: Duplicate definition of loadZkTrie (without paranoid parameter).

This appears to be an old version of the function. Retain only one version of loadZkTrie to avoid conflicts and confusion.

omerfirmak force-pushed the omerfirmak/checker-cmd branch 17 times, most recently from a9d9968 to 1651970 Compare February 12, 2025 14:19

omerfirmak marked this pull request as ready for review February 17, 2025 11:34

omerfirmak requested a review from Thegaram February 17, 2025 11:34

coderabbitai bot reviewed Feb 17, 2025

View reviewed changes

trie/zk_trie.go Outdated Show resolved Hide resolved

trie/zk_trie.go Outdated Show resolved Hide resolved

coderabbitai bot reviewed Feb 20, 2025

View reviewed changes

cmd/migration-checker/main.go Show resolved Hide resolved

cmd/migration-checker/main.go Outdated Show resolved Hide resolved

trie/zk_trie.go Outdated Show resolved Hide resolved

omerfirmak added 3 commits February 21, 2025 17:26

feat: add migration checker cmd

880fb24

wait for all checkers to finish

4f42703

paranoid flag

12ec3a6

omerfirmak force-pushed the omerfirmak/checker-cmd branch from 81ab44f to 12ec3a6 Compare February 21, 2025 14:26

make linter happy

642c35f

coderabbitai bot reviewed Feb 21, 2025

View reviewed changes

cmd/migration-checker/main.go Show resolved Hide resolved

cmd/migration-checker/main.go Show resolved Hide resolved

coderabbitai bot reviewed Feb 21, 2025

View reviewed changes

cmd/migration-checker/main.go Outdated Show resolved Hide resolved

cmd/migration-checker/main.go Show resolved Hide resolved

print some more stats

8a7e8ca

coderabbitai bot reviewed Mar 11, 2025

View reviewed changes

allow configuring paralelism

d623b3f

coderabbitai bot reviewed Mar 11, 2025

View reviewed changes

add pprof

0eb1fb4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add migration checker cmd #1114

feat: add migration checker cmd #1114

omerfirmak commented Feb 10, 2025 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Feb 10, 2025 •

edited

Loading

Chat

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

CodeRabbit Configuration File (`.coderabbit.yaml`)

Documentation and Community

coderabbitai bot left a comment

coderabbitai bot left a comment

coderabbitai bot left a comment

coderabbitai bot left a comment

coderabbitai bot left a comment

coderabbitai bot Mar 11, 2025

coderabbitai bot Mar 11, 2025

coderabbitai bot Mar 11, 2025

coderabbitai bot Mar 11, 2025

coderabbitai bot Mar 11, 2025

coderabbitai bot left a comment

		for i := 0; i < runtime.GOMAXPROCS(0)*4; i++ {
		<-trieCheckers

feat: add migration checker cmd #1114

Are you sure you want to change the base?

feat: add migration checker cmd #1114

Conversation

omerfirmak commented Feb 10, 2025 • edited by coderabbitai bot Loading

1. Purpose or design rationale of this PR

2. PR title

3. Deployment tag versioning

4. Breaking change label

Summary by CodeRabbit

coderabbitai bot commented Feb 10, 2025 • edited Loading

Walkthrough

Changes

Sequence Diagram(s)

Poem

Chat

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

CodeRabbit Configuration File (.coderabbit.yaml)

Documentation and Community

coderabbitai bot left a comment

Choose a reason for hiding this comment

coderabbitai bot left a comment

Choose a reason for hiding this comment

coderabbitai bot left a comment

Choose a reason for hiding this comment

coderabbitai bot left a comment

Choose a reason for hiding this comment

coderabbitai bot left a comment

Choose a reason for hiding this comment

coderabbitai bot Mar 11, 2025

Choose a reason for hiding this comment

coderabbitai bot Mar 11, 2025

Choose a reason for hiding this comment

coderabbitai bot Mar 11, 2025

Choose a reason for hiding this comment

coderabbitai bot Mar 11, 2025

Choose a reason for hiding this comment

coderabbitai bot Mar 11, 2025

Choose a reason for hiding this comment

coderabbitai bot left a comment

Choose a reason for hiding this comment

omerfirmak commented Feb 10, 2025 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Feb 10, 2025 •

edited

Loading

CodeRabbit Configuration File (`.coderabbit.yaml`)