chore: address code review findings

poyrazK · poyrazK · commit 3bc1477e3b35 · 2026-04-09T23:12:57.000+03:00
- Fix deadlock in BufferPoolManager by using internal lookups directly.

- Safe heap_table cross-iterator destruction.

- Restore undo_log in query executor.

- Add safety checks to execution benchmarks.

- Align benchmark documentation.
diff --git a/benchmarks/sqlite_comparison_bench.cpp b/benchmarks/sqlite_comparison_bench.cpp
@@ -174,15 +174,26 @@ static void BM_CloudSQL_Scan(benchmark::State& state) {
             "INSERT INTO bench_table VALUES (" + std::to_string(i) + ", 1.1, 'data');"));
     }
 
+    auto parsed_base = ParseSQL("SELECT * FROM bench_table");
+    if (!parsed_base || parsed_base->type() != parser::StmtType::Select) {
+        state.SkipWithError("Failed to parse SELECT statement");
+        return;
+    }
     auto select_stmt = std::unique_ptr<parser::SelectStatement>(
-        static_cast<parser::SelectStatement*>(ParseSQL("SELECT * FROM bench_table").release()));
+        static_cast<parser::SelectStatement*>(parsed_base.release()));
 
     auto root = ctx.executor->build_plan(*select_stmt, nullptr);
+    if (!root) {
+        state.SkipWithError("Failed to build execution plan");
+        return;
+    }
     root->set_memory_resource(&ctx.executor->arena());
 
     for (auto _ : state) {
-        root->init();
-        root->open();
+        if (!root->init() || !root->open()) {
+            state.SkipWithError("Failed to open plan");
+            return;
+        }
         cloudsql::executor::Tuple tuple;
         while (root->next(tuple)) {
             benchmark::DoNotOptimize(tuple);
diff --git a/docs/performance/SQLITE_COMPARISON.md b/docs/performance/SQLITE_COMPARISON.md
@@ -21,15 +21,15 @@ This report documents the head-to-head performance comparison between the `cloud
 ## 4. Architectural Analysis
 
 ### Point Inserts
-The 7.1x gap in insertion speed is attributed to:
-1.  **Statement Parsing Overhead**: Our benchmark currently re-parses SQL strings for every `INSERT` in `cloudSQL`, whereas SQLite uses a prepared statement (`sqlite3_prepare_v2`).
-2.  **Object Allocations**: `cloudSQL` allocates multiple `std::unique_ptr` objects (Statements, Expressions, Tuples) per row. SQLite uses a specialized register-based virtual machine with minimal allocations.
-3.  **Storage Engine Maturity**: SQLite's B-Tree implementation is highly optimized for write-ahead logging and paged I/O compared to our current Heap Table.
+Following our latest optimizations, `cloudSQL` completely bridged the insert gap and is now **~58x faster** than SQLite. The dramatic inversion in performance is attributed to:
+1.  **Prepared Statement Execution**: `cloudSQL` benchmarks now correctly cache and reuse prepared insert statements matching SQLite's `sqlite3_prepare_v2` approach, completely skipping re-parsing overheads per row.
+2.  **Batch Insert Fast-Path**: By detecting bulk loads into memory, `cloudSQL` entirely bypasses single-row exclusive lock acquisitions (while correctly maintaining undo logs).
+3.  **In-Memory Architecture**: This configuration allows `cloudSQL` to behave as a massive unhindered memory bump-allocator, whereas SQLite still respects basic transactional boundaries even with `PRAGMA synchronous=OFF`.
 
 ### Sequential Scans
-The 6.5x gap in scan speed is attributed to:
+We reduced the scan gap from 6.5x down to **4.0x** slower than SQLite. The remaining gap is attributed to:
 1.  **Volcano Model Overhead**: `cloudSQL` uses a tuple-at-a-time iterator model with virtual function calls for `next()`.
-2.  **Value Type Overhead**: Our `common::Value` class uses `std::variant`, which introduces a small overhead for every column access compared to SQLite's raw buffer indexing.
+2.  **Value Type Allocations**: Scanning in `cloudSQL` fundamentally builds `std::pmr::vector<common::Value>` using `std::variant` properties for each row, constructing dense memory structures. SQLite's cursor is highly optimized to avoid unnecessary buffer copying unless columns are fetched.
 
 ## 5. Post-Optimization Enhancements
 We addressed the gaps via the following optimizations:
diff --git a/src/executor/query_executor.cpp b/src/executor/query_executor.cpp
@@ -211,11 +211,13 @@ QueryResult QueryExecutor::execute(const PreparedStatement& prepared,
                     }
                 }
 
-                if (txn != nullptr && !batch_insert_mode_) {
+                if (txn != nullptr) {
                     txn->add_undo_log(transaction::UndoLog::Type::INSERT, prepared.table_meta->name,
                                       tid);
-                    if (!lock_manager_.acquire_exclusive(txn, tid)) {
-                        throw std::runtime_error("Failed to acquire exclusive lock");
+                    if (!batch_insert_mode_) {
+                        if (!lock_manager_.acquire_exclusive(txn, tid)) {
+                            throw std::runtime_error("Failed to acquire exclusive lock");
+                        }
                     }
                 }
                 rows_inserted++;
diff --git a/src/storage/buffer_pool_manager.cpp b/src/storage/buffer_pool_manager.cpp
@@ -146,7 +146,7 @@ bool BufferPoolManager::unpin_page_by_id(uint32_t file_id, uint32_t page_id, boo
 bool BufferPoolManager::flush_page(const std::string& file_name, uint32_t page_id) {
     const std::scoped_lock<std::mutex> lock(latch_);
 
-    const uint32_t file_id = get_file_id(file_name);
+    const uint32_t file_id = get_file_id_internal(file_name);
     const PageKey key{file_id, page_id};
 
     if (page_table_.find(key) == page_table_.end()) {
@@ -204,7 +204,7 @@ Page* BufferPoolManager::new_page(const std::string& file_name, uint32_t* page_i
 bool BufferPoolManager::delete_page(const std::string& file_name, uint32_t page_id) {
     const std::scoped_lock<std::mutex> lock(latch_);
 
-    const uint32_t file_id = get_file_id(file_name);
+    const uint32_t file_id = get_file_id_internal(file_name);
     const PageKey key{file_id, page_id};
 
     if (page_table_.find(key) != page_table_.end()) {
diff --git a/src/storage/heap_table.cpp b/src/storage/heap_table.cpp
@@ -81,6 +81,14 @@ HeapTable::Iterator::Iterator(Iterator&& other) noexcept
 
 HeapTable::Iterator& HeapTable::Iterator::operator=(Iterator&& other) noexcept {
     if (this != &other) {
+        if (&table_ != &other.table_) {
+            if (other.current_page_) {
+                other.table_.bpm_.unpin_page_by_id(other.table_.file_id_, other.current_page_num_, false);
+                other.current_page_ = nullptr;
+            }
+            return *this;
+        }
+
         if (current_page_) {
             table_.bpm_.unpin_page_by_id(table_.file_id_, current_page_num_, false);
         }

Original file line number	Diff line number	Diff line change
`@@ -211,11 +211,13 @@ QueryResult QueryExecutor::execute(const PreparedStatement& prepared,`
`211`	`211`	`}`
`212`	`212`	`}`
`213`	`213`
`214`		`- if (txn != nullptr && !batch_insert_mode_) {`
	`214`	`+ if (txn != nullptr) {`
`215`	`215`	`txn->add_undo_log(transaction::UndoLog::Type::INSERT, prepared.table_meta->name,`
`216`	`216`	`tid);`
`217`		`- if (!lock_manager_.acquire_exclusive(txn, tid)) {`
`218`		`- throw std::runtime_error("Failed to acquire exclusive lock");`
	`217`	`+ if (!batch_insert_mode_) {`
	`218`	`+ if (!lock_manager_.acquire_exclusive(txn, tid)) {`
	`219`	`+ throw std::runtime_error("Failed to acquire exclusive lock");`
	`220`	`+ }`
`219`	`221`	`}`
`220`	`222`	`}`
`221`	`223`	`rows_inserted++;`