MID-10990, MID-11046, MID-11047: Fix CSV export performance and stability issues #482

wadahiro · 2026-01-28T14:47:44Z

Summary

This PR addresses OutOfMemoryError during CSV export of large datasets and improves export performance significantly.

Background

Before this fix, CSV export of large datasets had several critical issues:

1. OutOfMemoryError

Loading all data into memory caused OOME with large datasets.

2. PostgreSQL IN clause parameter limit

Export of more than 65,535 records was impossible due to PostgreSQL's prepared statement parameter limit:

Caused by: org.postgresql.util.PSQLException: PreparedStatement can have at most 65,535 parameters.
Please consider using arrays, or splitting the query in several ones, or using COPY.
Given query has 91,362 parameters
  at org.postgresql.jdbc.PgPreparedStatement.<init>(PgPreparedStatement.java:107)
  at com.querydsl.sql.AbstractSQLQuery.fetch(AbstractSQLQuery.java:439)

3. AccessCertificationWorkItem export performance issues

Even after resolving the OOME and IN clause issues, AccessCertificationWorkItem export had severe performance problems. Exporting 5,000 WorkItems took over 8 minutes due to multiple N+1 query problems:

Data structure:

Campaign (1)
  └── Case (5000)
        └── WorkItem (500,000 = 5000 cases × 100 reviewers)

N+1 queries per WorkItem:

Campaign fetch - Each WorkItem triggered a separate Campaign query
Case fetch - Each WorkItem triggered a separate Case query
Reference fetch - Each WorkItem triggered a separate reference query
objectRef/targetRef displayName fetch (GUI layer) - Each row triggered loadObject() to resolve display names

Solution

This fix implements:

JDBC cursor-based streaming: Processes rows one by one without loading all OIDs into an IN clause
Batch loading with beforeTransformation: Loads Campaign, Case, and references in batches of 100 items using IN clauses
displayName caching in ReferenceNameResolver: Caches name/displayName across batches to avoid redundant queries
GUI layer optimization: Uses pre-loaded objects from ref.getObject() instead of calling loadObject() for each row

Changes

MID-10990: OutOfMemoryError during CSV Export of Large Datasets

fix OutOfMemoryError during CSV export of large datasets: Implement streaming CSV export with IterativeExportSupport and StreamingCsvDataExporter
Optimize OperationResult.cleanup() from O(n²) to O(n): Fix performance bottleneck in result cleanup
Avoid NoSuchMessageException in LocalizationServiceImpl: Skip unnecessary exception handling for better performance
Add JDBC streaming mode support for searchContainersIterative: Enable true JDBC cursor-based streaming with iterationPageSize=-1
Add JDBC streaming mode support to searchObjectsIterative: Extend streaming support to Object export
Add JDBC streaming support for audit log CSV export: Apply streaming to AuditLogViewer export
Use lightweight wrapper for CSV export: Skip expensive child wrapper creation during export
Optimize AccessCertificationWorkItem export for large datasets: Implement batch loading with beforeTransformation to eliminate N+1 queries

MID-11047: AccessCertificationWorkItem list unstable display order

Add default sort order to WorkItem list: Sort by PK order (ownerOid, accessCertCaseCid, cid) for stable display order

MID-11046: CSV export missing .csv extension

Fix CSV export filename missing .csv extension: Ensure .csv extension is appended regardless of user input

Performance Results

AccessCertificationWorkItem Export

Condition	Time
5000 WorkItems (5000 cases x 100 reviewers, filtered by 1 reviewer)	~27sec

User Export (Large Dataset)

Records	Time	File Size
~100,000 users	~20sec	~7MB

… better performance

…or Object export

…r creation

wadahiro added 10 commits January 29, 2026 09:35

MID-10990 fix OutOfMemoryError during CSV export of large datasets

ddf8fb2

MID-10990 Optimize OperationResult.cleanup() from O(n²) to O(n)

8e3d1a9

MID-10990 Avoid NoSuchMessageException in LocalizationServiceImpl for…

17dbf44

… better performance

MID-10990 Add JDBC streaming mode support for searchContainersIterative

8e4388c

MID-10990 Add JDBC streaming mode support to searchObjectsIterative f…

3002f85

…or Object export

MID-10990 Add JDBC streaming support for audit log CSV export

0d27b65

MID-10990 Use lightweight wrapper for CSV export to skip child wrappe…

ebb2e1f

…r creation

MID-10990 Optimize AccessCertificationWorkItem export for large datasets

c5c3529

MID-11047 Add default sort order to WorkItem list for stable ordering

6343a52

MID-11046 Fix CSV export filename missing .csv extension

a58b7e7

wadahiro force-pushed the fix-10990-export-oome branch from 9e53d2e to a58b7e7 Compare January 29, 2026 00:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MID-10990, MID-11046, MID-11047: Fix CSV export performance and stability issues #482

MID-10990, MID-11046, MID-11047: Fix CSV export performance and stability issues #482

Uh oh!

wadahiro commented Jan 28, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

1 participant

MID-10990, MID-11046, MID-11047: Fix CSV export performance and stability issues #482

Are you sure you want to change the base?

MID-10990, MID-11046, MID-11047: Fix CSV export performance and stability issues #482

Uh oh!

Conversation

wadahiro commented Jan 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Background

1. OutOfMemoryError

2. PostgreSQL IN clause parameter limit

3. AccessCertificationWorkItem export performance issues

Solution

Changes

MID-10990: OutOfMemoryError during CSV Export of Large Datasets

MID-11047: AccessCertificationWorkItem list unstable display order

MID-11046: CSV export missing .csv extension

Performance Results

AccessCertificationWorkItem Export

User Export (Large Dataset)

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

1 participant

wadahiro commented Jan 28, 2026 •

edited

Loading