Retry auto commit upserts on out-of-memory error by xiexiaoy · Pull Request #369 · eloqdata/eloqdoc

xiexiaoy · 2025-12-15T07:10:23Z

Summary by CodeRabbit

Bug Fixes
- Added automatic retry (up to 5 attempts) with brief backoff for transient write/command failures (e.g., write conflicts, memory limits), with retry logging.
- Refused new sessions when connection limits are exceeded to prevent over-capacity acceptance.
Chores
- Improved diagnostic logging to include transaction identifiers and richer error details for KV/batch operations and commit failures.

_{✏️ Tip: You can customize this high-level summary in your review settings.}

coderabbitai · 2025-12-15T07:10:39Z

Walkthrough

Adds per-call retry loops (up to 5 attempts) around EloqDoc-enabled command execution and data-modifying receiver paths, augments Eloq module logs to include transaction IDs and error details, and enforces an early return when session connections exceed the configured maximum.

Changes

Cohort / File(s)	Summary
Retry mechanism for Eloq write paths `src/mongo/db/service_entry_point_common.cpp`	Introduces `RETRY_NUM = 5` and wraps Eloq-enabled atomic command branches plus `receivedInsert`, `receivedUpdate`, and `receivedDelete` in per-attempt try/catch loops. Each iteration manages a WriteUnitOfWork and commits per-iteration; retries occur on `WriteConflict` or `ExceededMemoryLimit` with short sleep and logging. Other exceptions are rethrown or logged and break the loop.
Eloq record store logging `src/mongo/db/modules/eloq/src/eloq_record_store.cpp`	Adds a warning log when `_insertRecords` observes a non-`NO_ERROR` from `batchGetKV`, including table name, transaction number, and the error message. No control-flow changes.
Eloq recovery unit logging `src/mongo/db/modules/eloq/src/eloq_recovery_unit.cpp`	Enhances logs in `batchGetKV` and `_txnClose` to include the transaction id (txn) alongside existing error details; captures txn locally for clearer logging. No behavioral changes.
Session start capacity check `src/mongo/transport/service_entry_point_impl.cpp`	Activates an early return when current connections exceed the maximum allowed, logging an explicit "connection refused because too many open connections" message and preventing further session acceptance/initialization.

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~40 minutes

Areas needing extra attention:
- RecoveryUnit / WriteUnitOfWork lifecycle and commit semantics across retry iterations in service_entry_point_common.cpp.
- Potential for duplicate side-effects or repeated application of operations when retries re-enter the same code paths.
- Correct inclusion and formatting of txn ids in Eloq logs and consistency of log contexts.

Possibly related PRs

Remove the retry in EloqCatalogRecordStore::insertRecord() on failed to create table. #222 — Moves retry responsibility for Eloq catalog/table creation; directly related to retry-placement changes and may overlap.
Retry after sleep random milliseconds when failed to create collection #226 — Alters retry/sleep behavior in Eloq paths; potentially overlapping with the introduced backoff/retry semantics.

Suggested reviewers

lzxddz

Poem

🐇 I hop five times when writes collide,
I log each txn and take a quiet slide.
A pause, a breath, then one more try —
resilient hops beneath the sky. 🥕

Pre-merge checks and finishing touches

❌ Failed checks (2 warnings)

Check name	Status	Explanation	Resolution
Title check	⚠️ Warning	The PR title focuses on retrying upserts on out-of-memory errors, but the changeset includes retry mechanisms for broader command execution paths, insert/update/delete operations, logging enhancements across multiple files, and connection limit handling—only partially aligned with the title's focus.	Update the title to better reflect the full scope, such as: 'Add retry mechanisms for EloqDoc commands and data operations on transient errors' or similar to capture the broader changes across multiple modules.
Docstring Coverage	⚠️ Warning	Docstring coverage is 0.00% which is insufficient. The required threshold is 80.00%.	You can run `@coderabbitai generate docstrings` to improve docstring coverage.

✅ Passed checks (1 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.

✨ Finishing touches

📝 Generate docstrings

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment

📜 Recent review details

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between c469469 and 20fc04a.

📒 Files selected for processing (1)

src/mongo/transport/service_entry_point_impl.cpp (1 hunks)

🚧 Files skipped from review as they are similar to previous changes (1)

src/mongo/transport/service_entry_point_impl.cpp

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 3

🧹 Nitpick comments (1)

src/mongo/db/service_entry_point_common.cpp (1)

1181-1280: Consider extracting a reusable retry helper.

The retry logic is duplicated across receivedInsert, receivedUpdate, and receivedDelete. Consider extracting a template or lambda-based helper to reduce duplication and ensure consistency:

template <typename Op>
void executeWithRetry(OperationContext* opCtx, Op&& operation, const char* opName) {
    int retry = RETRY_NUM;
    while (retry-- > 0) {
        try {
            WriteUnitOfWork wuow(opCtx);
            operation();
            if (opCtx->getRecoveryUnitState() == WriteUnitOfWork::kActiveUnitOfWork) {
                wuow.commit();
            }
            break;
        } catch (const DBException& ex) {
            if ((ex.code() == ErrorCodes::WriteConflict ||
                 ex.code() == ErrorCodes::ExceededMemoryLimit) && retry > 0) {
                LOG(1) << opName << " throw DBException " << ex.what() 
                       << " retrying " << (RETRY_NUM - retry) << "/" << RETRY_NUM;
                continue;
            }
            LOG(1) << opName << " throw DBException " << ex.what();
            throw;
        }
    }
}

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 83690ed and e2bde35.

📒 Files selected for processing (2)

src/mongo/db/modules/eloq/src/eloq_recovery_unit.cpp (1 hunks)
src/mongo/db/service_entry_point_common.cpp (5 hunks)

🧰 Additional context used

🧠 Learnings (4)

📓 Common learnings

Learnt from: githubzilla
Repo: eloqdata/eloqdoc PR: 211
File: src/mongo/db/modules/eloq/cmake/build_eloq_log_service.cmake:26-80
Timestamp: 2025-09-25T12:24:06.434Z
Learning: The user githubzilla implemented a fix in commit fe98aaf to address the MEMORY state incorrectly triggering RocksDB discovery in build_eloq_log_service.cmake. This was a control flow issue where MEMORY builds were falling into an else branch that still attempted RocksDB discovery.

📚 Learning: 2025-09-25T12:24:06.434Z

Learnt from: githubzilla
Repo: eloqdata/eloqdoc PR: 211
File: src/mongo/db/modules/eloq/cmake/build_eloq_log_service.cmake:26-80
Timestamp: 2025-09-25T12:24:06.434Z
Learning: The user githubzilla implemented a fix in commit fe98aaf to address the MEMORY state incorrectly triggering RocksDB discovery in build_eloq_log_service.cmake. This was a control flow issue where MEMORY builds were falling into an else branch that still attempted RocksDB discovery.

Applied to files:

src/mongo/db/modules/eloq/src/eloq_recovery_unit.cpp

📚 Learning: 2025-11-28T10:55:21.020Z

Learnt from: githubzilla
Repo: eloqdata/eloqdoc PR: 279
File: src/mongo/db/modules/eloq/CMakeLists.txt:54-55
Timestamp: 2025-11-28T10:55:21.020Z
Learning: In src/mongo/db/modules/eloq/CMakeLists.txt, the CMake options EXT_TX_PROC_ENABLED, FORK_HM_PROCESS, and STATISTICS are used by the data_substrate submodule build system and should be retained as configuration interface options.

Applied to files:

src/mongo/db/modules/eloq/src/eloq_recovery_unit.cpp

📚 Learning: 2025-11-14T03:10:22.027Z

Learnt from: githubzilla
Repo: eloqdata/eloqdoc PR: 279
File: src/mongo/db/dbmain.cpp:0-0
Timestamp: 2025-11-14T03:10:22.027Z
Learning: In this repository, avoid defining --config via gflags in src/mongo/db/dbmain.cpp; MongoDB’s options_parser owns --config. Prefer gflags flags with a distinct prefix (e.g., --data_substrate_config) parsed before options_parser, with options_parser configured to allow unregistered options.

Applied to files:

src/mongo/db/service_entry_point_common.cpp

🔇 Additional comments (2)

src/mongo/db/modules/eloq/src/eloq_recovery_unit.cpp (1)

484-494: LGTM - OUT_OF_MEMORY added as retryable error.

This correctly groups OUT_OF_MEMORY with other transient/retryable errors for logging purposes, consistent with the retry mechanism being added in service_entry_point_common.cpp.

src/mongo/db/service_entry_point_common.cpp (1)

107-108: LGTM - Retry constant defined.

Good practice to define the retry count as a named constant.

src/mongo/db/service_entry_point_common.cpp

coderabbitai

Actionable comments posted: 0

♻️ Duplicate comments (1)

src/mongo/db/service_entry_point_common.cpp (1)
107-107: Retry loops: missing break on success and exceptions are swallowed; also hard‑coded retry count

Across receivedInsert, receivedUpdate, and receivedDelete:

After a successful wuow.commit(), the loop does not break, so the same write is re‑executed up to RETRY_NUM times. For non‑idempotent inserts/updates/deletes this can cause duplicate or repeated modifications.

In the catch (const DBException& ex) block, when the error is non‑retryable or retries are exhausted, you just log and break. The function then returns as if the operation succeeded, so callers (and LastError) won’t see the failure. This is a behavior change from the previous code where the exception propagated.

You now catch DBException here, so the exception never reaches the outer handleRequest catch (AssertionException&), further hiding failures.

The log message hard‑codes 5 and (5 - retry), which will diverge from the actual retry count if RETRY_NUM is changed.

These issues were already called out in earlier review comments and still apply here. Please adjust the loops so that:

On success, you immediately break out of the loop.

On a retryable error (WriteConflict/ExceededMemoryLimit) with attempts left, you log and continue.

Otherwise you log and throw; to propagate the failure.

Use RETRY_NUM consistently in the log message instead of the literal 5.

For example, for receivedInsert (apply the same pattern to update/delete):
-    int retry = RETRY_NUM;
-    while (retry-- > 0) {
-        try {
-
-            WriteUnitOfWork wuow(opCtx);
-            performInserts(opCtx, insertOp);
-            if (opCtx->getRecoveryUnitState() == WriteUnitOfWork::kActiveUnitOfWork) {
-                wuow.commit();
-            }
-        } catch (const DBException& ex) {
-            if ((ex.code() == ErrorCodes::WriteConflict ||
-                 ex.code() == ErrorCodes::ExceededMemoryLimit) &&
-                retry > 0) {
-                LOG(1) << "performInserts throw DBException " << ex.what() << " retrying "
-                       << (5 - retry) << "/5";
-                continue;
-            } else {
-                LOG(1) << "performInserts throw DBException " << ex.what();
-                break;
-            }
-        }
-    }
+    int retry = RETRY_NUM;
+    while (retry-- > 0) {
+        try {
+            WriteUnitOfWork wuow(opCtx);
+            performInserts(opCtx, insertOp);
+            if (opCtx->getRecoveryUnitState() == WriteUnitOfWork::kActiveUnitOfWork) {
+                wuow.commit();
+            }
+            break;  // Success – exit retry loop
+        } catch (const DBException& ex) {
+            if ((ex.code() == ErrorCodes::WriteConflict ||
+                 ex.code() == ErrorCodes::ExceededMemoryLimit) &&
+                retry > 0) {
+                LOG(1) << "performInserts throw DBException " << ex.what()
+                       << " retrying " << (RETRY_NUM - retry) << "/" << RETRY_NUM;
+                continue;
+            } else {
+                LOG(1) << "performInserts throw DBException " << ex.what();
+                throw;  // Propagate final / non‑retryable failure
+            }
+        }
+    }
Same structure should be applied to the update and delete handlers to avoid duplicate writes and to preserve error propagation semantics.

Also applies to: 1181-1200, 1226-1244, 1260-1278

🧹 Nitpick comments (1)

src/mongo/db/service_entry_point_common.cpp (1)

46-46: Unused write_conflict_exception include

This header isn’t used in this TU (we only reference ErrorCodes::WriteConflict). If you don’t plan to use writeConflictRetry / WriteConflictException here, consider dropping the include to avoid an extra dependency.

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between e2bde35 and 51545be.

📒 Files selected for processing (1)

src/mongo/db/service_entry_point_common.cpp (5 hunks)

🧰 Additional context used

🧠 Learnings (1)

📚 Learning: 2025-11-14T03:10:22.027Z

Learnt from: githubzilla
Repo: eloqdata/eloqdoc PR: 279
File: src/mongo/db/dbmain.cpp:0-0
Timestamp: 2025-11-14T03:10:22.027Z
Learning: In this repository, avoid defining --config via gflags in src/mongo/db/dbmain.cpp; MongoDB’s options_parser owns --config. Prefer gflags flags with a distinct prefix (e.g., --data_substrate_config) parsed before options_parser, with options_parser configured to allow unregistered options.

Applied to files:

src/mongo/db/service_entry_point_common.cpp

coderabbitai

Actionable comments posted: 1

♻️ Duplicate comments (3)

src/mongo/db/service_entry_point_common.cpp (3)
1195-1217: Critical: Exception silently swallowed on failure.

The retry loop has the same critical issue flagged in the previous review: when retries are exhausted or a non-retryable exception occurs, the code breaks without re-throwing, so clients won't know the operation failed.

Apply this diff to fix:
             } else {
                 LOG(1) << "performInserts throw DBException " << ex.what();
-                break;
+                throw;  // Re-throw to propagate error to caller
             }
1241-1262: Critical: Exception silently swallowed on failure.

This has the same critical bug as the insert retry loop: exceptions are silently swallowed when retries are exhausted or a non-retryable exception occurs.

Apply this diff to fix:
             } else {
                 LOG(1) << "performUpdates throw DBException " << ex.what();
-                break;
+                throw;  // Re-throw to propagate error to caller
             }
1276-1297: Critical: Exception silently swallowed on failure.

This has the same critical bug as the insert and update retry loops: exceptions are silently swallowed when retries are exhausted or a non-retryable exception occurs.

Apply this diff to fix:
             } else {
                 LOG(1) << "performDeletes throw DBException " << ex.what();
-                break;
+                throw;  // Re-throw to propagate error to caller
             }

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 7e249eb and 0e5fe19.

📒 Files selected for processing (1)

src/mongo/db/service_entry_point_common.cpp (6 hunks)

🧰 Additional context used

🧠 Learnings (1)

📚 Learning: 2025-11-14T03:10:22.027Z

Learnt from: githubzilla
Repo: eloqdata/eloqdoc PR: 279
File: src/mongo/db/dbmain.cpp:0-0
Timestamp: 2025-11-14T03:10:22.027Z
Learning: In this repository, avoid defining --config via gflags in src/mongo/db/dbmain.cpp; MongoDB’s options_parser owns --config. Prefer gflags flags with a distinct prefix (e.g., --data_substrate_config) parsed before options_parser, with options_parser configured to allow unregistered options.

Applied to files:

src/mongo/db/service_entry_point_common.cpp

🔇 Additional comments (2)

src/mongo/db/service_entry_point_common.cpp (2)

46-46: LGTM!

The include is necessary for handling WriteConflict exceptions in the retry logic.

107-107: LGTM!

The retry constant is appropriately defined in the anonymous namespace.

src/mongo/db/service_entry_point_common.cpp

coderabbitai

Actionable comments posted: 2

♻️ Duplicate comments (3)

src/mongo/db/service_entry_point_common.cpp (3)
1232-1235: Critical: Exception still swallowed - break should be throw.

The past review flagged this issue (marked as addressed) but the code still shows break instead of throw. When a non-retryable exception occurs or retries are exhausted, the exception is silently swallowed and the function returns normally.

Apply this diff:
             } else {
                 LOG(1) << "performInserts throw DBException " << ex.what();
-                break;
+                throw;
             }
1278-1281: Critical: Same issue - break should be throw.

Same problem as receivedInsert. The exception is silently swallowed when retries are exhausted or a non-retryable exception occurs.

Apply this diff:
             } else {
                 LOG(1) << "performUpdates throw DBException " << ex.what();
-                break;
+                throw;
             }
1314-1317: Critical: Same issue - break should be throw.

Same problem as the other receiver functions. The exception is silently swallowed.

Apply this diff:
             } else {
                 LOG(1) << "performDeletes throw DBException " << ex.what();
-                break;
+                throw;
             }

🧹 Nitpick comments (1)

src/mongo/db/service_entry_point_common.cpp (1)

1214-1237: Consider extracting a common retry helper to ensure consistency.

There are two slightly different retry patterns in this file:

runCommandImpl: doesn't check retry > 0 in catch, throws for non-retryable
receivedInsert/Update/Delete: checks retry > 0, but breaks (should throw)

Both have bugs that result in swallowed exceptions. A common helper function would prevent these inconsistencies:

template <typename Func>
void retryOnTransientError(OperationContext* opCtx, const char* opName, Func&& fn) {
    for (int attempt = 1; attempt <= RETRY_NUM; ++attempt) {
        try {
            fn();
            return;  // Success
        } catch (const DBException& ex) {
            if ((ex.code() == ErrorCodes::WriteConflict ||
                 ex.code() == ErrorCodes::ExceededMemoryLimit) &&
                attempt < RETRY_NUM) {
                LOG(1) << opName << " throw DBException " << ex.what()
                       << " retrying " << attempt << "/" << RETRY_NUM;
                opCtx->sleepFor(Milliseconds(1));
                continue;
            }
            throw;  // Re-throw on final attempt or non-retryable
        }
    }
}

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 62db8bc and 1fc5831.

📒 Files selected for processing (3)

src/mongo/db/modules/eloq/src/eloq_record_store.cpp (1 hunks)
src/mongo/db/modules/eloq/src/eloq_recovery_unit.cpp (2 hunks)
src/mongo/db/service_entry_point_common.cpp (7 hunks)

🚧 Files skipped from review as they are similar to previous changes (2)

src/mongo/db/modules/eloq/src/eloq_recovery_unit.cpp
src/mongo/db/modules/eloq/src/eloq_record_store.cpp

🧰 Additional context used

🧠 Learnings (2)

📓 Common learnings

Learnt from: githubzilla
Repo: eloqdata/eloqdoc PR: 211
File: src/mongo/db/modules/eloq/cmake/build_eloq_log_service.cmake:26-80
Timestamp: 2025-09-25T12:24:06.434Z
Learning: The user githubzilla implemented a fix in commit fe98aaf to address the MEMORY state incorrectly triggering RocksDB discovery in build_eloq_log_service.cmake. This was a control flow issue where MEMORY builds were falling into an else branch that still attempted RocksDB discovery.

📚 Learning: 2025-11-14T03:10:22.027Z

Learnt from: githubzilla
Repo: eloqdata/eloqdoc PR: 279
File: src/mongo/db/dbmain.cpp:0-0
Timestamp: 2025-11-14T03:10:22.027Z
Learning: In this repository, avoid defining --config via gflags in src/mongo/db/dbmain.cpp; MongoDB’s options_parser owns --config. Prefer gflags flags with a distinct prefix (e.g., --data_substrate_config) parsed before options_parser, with options_parser configured to allow unregistered options.

Applied to files:

src/mongo/db/service_entry_point_common.cpp

🔇 Additional comments (2)

src/mongo/db/service_entry_point_common.cpp (2)

46-46: LGTM!

Include is necessary for the WriteConflict error code used in the retry logic.

107-108: LGTM!

Defining the retry count as a named constant is good practice for maintainability.

src/mongo/db/service_entry_point_common.cpp

coderabbitai

Actionable comments posted: 0

🧹 Nitpick comments (1)

src/mongo/transport/service_entry_point_impl.cpp (1)

116-124: Minor optimization: Skip coroutine configuration for rejected connections.

The coroutine executor configuration executes even when connectionCount > _maxNumConnections, performing unnecessary work for connections that will be rejected at line 134. Consider moving this block after the capacity check to avoid wasted cycles.

     }
 
-    if (_coroutineExecutor) {
-        MONGO_LOG(0) << "use coroutine service executor";
-        ssm->setServiceExecutor(_coroutineExecutor.get());
-
-        // work balance
-        size_t targetThreadGroupId = connectionCount % serverGlobalParams.reservedThreadNum;
-        ssm->setThreadGroupId(targetThreadGroupId);
-        MONGO_LOG(0) << "Current ssm is assigned to thread group " << targetThreadGroupId;
-    }
-
     // Checking if we successfully added a connection above. Separated from the lock so we don't log
     // while holding it.
     if (connectionCount > _maxNumConnections) {
         if (!quiet) {
             // log() << "connection refused because too many open connections: " << connectionCount;
             log() << "too many open connections: " << connectionCount;
         }
 
         return;
     }
 
+    if (_coroutineExecutor) {
+        MONGO_LOG(0) << "use coroutine service executor";
+        ssm->setServiceExecutor(_coroutineExecutor.get());
+
+        // work balance
+        size_t targetThreadGroupId = connectionCount % serverGlobalParams.reservedThreadNum;
+        ssm->setThreadGroupId(targetThreadGroupId);
+        MONGO_LOG(0) << "Current ssm is assigned to thread group " << targetThreadGroupId;
+    }
+

📜 Review details

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between d6487d9 and c469469.

📒 Files selected for processing (1)

src/mongo/transport/service_entry_point_impl.cpp (1 hunks)

🧰 Additional context used

🧠 Learnings (1)

📓 Common learnings

Learnt from: githubzilla
Repo: eloqdata/eloqdoc PR: 211
File: src/mongo/db/modules/eloq/cmake/build_eloq_log_service.cmake:26-80
Timestamp: 2025-09-25T12:24:06.434Z
Learning: The user githubzilla implemented a fix in commit fe98aaf to address the MEMORY state incorrectly triggering RocksDB discovery in build_eloq_log_service.cmake. This was a control flow issue where MEMORY builds were falling into an else branch that still attempted RocksDB discovery.

🔇 Additional comments (1)

src/mongo/transport/service_entry_point_impl.cpp (1)

134-134: Early return prevents critical bugs, socket cleanup verified.

The early return is essential and correct. Without it, the code would:

Log "connection accepted" despite rejecting the connection (incorrect behavior).

Set up a cleanup hook that calls _sessions.erase(ssmIt) on an uninitialized iterator (undefined behavior/crash).

Start an untracked session (resource leak).

Socket cleanup is properly handled when returning early: the session object goes out of scope, triggering ~Session(), which closes the socket via ASIO. The explicit releaseSessionHandler() call in the cleanup hook (line 158) simply nulls the pointer and is not necessary for socket closure.

xiexiaoy force-pushed the retry_upserts branch from e2bde35 to 51545be Compare December 15, 2025 07:11

xiexiaoy added the trigger-ci label Dec 15, 2025

coderabbitai bot reviewed Dec 15, 2025

View reviewed changes

src/mongo/db/service_entry_point_common.cpp Outdated Show resolved Hide resolved

src/mongo/db/service_entry_point_common.cpp Outdated Show resolved Hide resolved

src/mongo/db/service_entry_point_common.cpp Outdated Show resolved Hide resolved

coderabbitai bot reviewed Dec 15, 2025

View reviewed changes

xiexiaoy force-pushed the retry_upserts branch from 51545be to 7e249eb Compare December 15, 2025 07:16

xiexiaoy marked this pull request as draft December 15, 2025 07:31

xiexiaoy force-pushed the retry_upserts branch from 7e249eb to 0e5fe19 Compare December 15, 2025 07:54

xiexiaoy marked this pull request as ready for review December 15, 2025 10:39

coderabbitai bot reviewed Dec 15, 2025

View reviewed changes

src/mongo/db/service_entry_point_common.cpp Outdated Show resolved Hide resolved

xiexiaoy marked this pull request as draft December 15, 2025 10:43

xiexiaoy changed the title ~~Retry auto commit upserts~~ Retry auto commit upserts on out-of-memory error Dec 16, 2025

xiexiaoy self-assigned this Dec 16, 2025

xiexiaoy force-pushed the retry_upserts branch 4 times, most recently from 8861aa0 to 62db8bc Compare December 16, 2025 09:38

xiexiaoy marked this pull request as ready for review December 16, 2025 09:41

xiexiaoy mentioned this pull request Dec 16, 2025

[Bug]: bench_doc Deleting 10000 documents (in batches of 500) get stuck eloqdata/tx_service#294

Closed

xiexiaoy marked this pull request as draft December 16, 2025 10:35

xiexiaoy force-pushed the retry_upserts branch 2 times, most recently from 4214dca to 1fc5831 Compare December 16, 2025 14:16

xiexiaoy marked this pull request as ready for review December 16, 2025 14:16

coderabbitai bot reviewed Dec 16, 2025

View reviewed changes

src/mongo/db/service_entry_point_common.cpp Outdated Show resolved Hide resolved

src/mongo/db/service_entry_point_common.cpp Outdated Show resolved Hide resolved

Retry auto_commit upserts

d6487d9

xiexiaoy force-pushed the retry_upserts branch from 1fc5831 to d6487d9 Compare December 16, 2025 16:13

xiexiaoy requested a review from lzxddz December 17, 2025 02:56

xiexiaoy linked an issue Dec 17, 2025 that may be closed by this pull request

[Bug]: Failed to insert batch: operation failed after 4 retries eloqdata/tx_service#278

Closed

lzxddz approved these changes Dec 17, 2025

View reviewed changes

Fix ServiceStateMachine cleanupHook _sessions.erase(ssmIt) ssmIt invalid

20fc04a

xiexiaoy force-pushed the retry_upserts branch from c469469 to 20fc04a Compare December 17, 2025 04:21

coderabbitai bot reviewed Dec 17, 2025

View reviewed changes

xiexiaoy merged commit 80d8852 into eloqdata:main Dec 17, 2025
2 of 3 checks passed

xiexiaoy deleted the retry_upserts branch December 17, 2025 06:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Retry auto commit upserts on out-of-memory error#369

Retry auto commit upserts on out-of-memory error#369
xiexiaoy merged 2 commits intoeloqdata:mainfrom
xiexiaoy:retry_upserts

xiexiaoy commented Dec 15, 2025 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Dec 15, 2025 •

edited

Loading

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

xiexiaoy commented Dec 15, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Dec 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Suggested reviewers

Poem

Pre-merge checks and finishing touches

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

xiexiaoy commented Dec 15, 2025 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Dec 15, 2025 •

edited

Loading