Boosting Entity Framework Bulk Insert Performance: Strategies and Benchmarks
Efficiently inserting large datasets into Entity Framework is crucial for avoiding performance bottlenecks. This article tackles the challenge of inserting 4000 records within a transaction, a scenario prone to incomplete transactions.
Avoiding the Pitfalls of Frequent SaveChanges()
Calls
The primary performance drain stems from repeatedly calling SaveChanges()
for each record. This approach significantly slows down the process. Here's how to optimize:
1. Single SaveChanges()
Call: Execute a single SaveChanges()
call after all records are added.
2. Batched SaveChanges()
Calls: Insert records in batches (e.g., 100 or 1000) and call SaveChanges()
after each batch.
3. Context Disposal with Batched SaveChanges()
: Combine batching with creating a new database context after each SaveChanges()
call. This clears the context, further enhancing performance.
Optimized Bulk Insert Pattern:
This code illustrates the recommended approach:
<code class="language-csharp">using (TransactionScope scope = new TransactionScope()) { using (MyDbContext context = new MyDbContext()) { context.Configuration.AutoDetectChangesEnabled = false; int count = 0; int commitCount = 100; // Adjust as needed foreach (var entity in someCollection) { count++; context.Set<entity>().Add(entity); if (count % commitCount == 0) { context.SaveChanges(); context.Dispose(); context = new MyDbContext(); context.Configuration.AutoDetectChangesEnabled = false; } } context.SaveChanges(); } scope.Complete(); }</code>
Performance Analysis:
Testing with 560,000 entities (9 scalar properties) yielded these results:
SaveChanges()
(commit count 1): > 20 minutesSaveChanges()
(commit count 1000): 242 secondsSaveChanges()
with Context Disposal (commit count 100): 164 secondsConclusion:
By avoiding frequent SaveChanges()
calls and utilizing batched inserts with context disposal, significant performance gains are achieved in Entity Framework bulk inserts. This optimization minimizes the risk of transaction timeouts and ensures efficient data handling within active transactions.
The above is the detailed content of How Can I Optimize Bulk Inserts in Entity Framework for Improved Performance?. For more information, please follow other related articles on the PHP Chinese website!