Streaming

Examples

S3 upload, multipart upload, progress tracking, parallel tables, and HTTP back-pressure.

Upload directly to S3

Use the AWS SDK v3 Upload helper with a Node.js Readable from toNodeReadable(). The upload streams the XLSX directly to S3 without writing a temp file to disk.

import { createExcelSchema, createWorkbookStream } from "@chronicstone/typed-xlsx";
import { S3Client } from "@aws-sdk/client-s3";
import { Upload } from "@aws-sdk/lib-storage";

const schema = createExcelSchema<{ id: string; amount: number }>()
  .column("id", { accessor: "id" })
  .column("amount", { accessor: "amount", style: { numFmt: "#,##0.00" } })
  .build();

const s3 = new S3Client({ region: "us-east-1" });

async function uploadReportToS3(key: string) {
  const workbook = createWorkbookStream({ tempStorage: "file", memoryProfile: "low-memory" });
  const table = await workbook.sheet("Report").table({ id: "data", schema });

  for await (const batch of fetchRowsCursor()) {
    await table.commit({ rows: batch });
  }

  const upload = new Upload({
    client: s3,
    params: {
      Bucket: "my-reports-bucket",
      Key: key,
      Body: workbook.toNodeReadable(),
      ContentType: "application/vnd.openxmlformats-officedocument.spreadsheetml.sheet",
    },
  });

  await upload.done();
}

Multipart upload to S3 (manual)

For very large reports where you want explicit control over S3 multipart part sizes:

import { createExcelSchema, createWorkbookStream } from "@chronicstone/typed-xlsx";
import {
  S3Client,
  CreateMultipartUploadCommand,
  UploadPartCommand,
  CompleteMultipartUploadCommand,
} from "@aws-sdk/client-s3";
import { Readable } from "node:stream";

const schema = createExcelSchema<{ name: string }>().column("name", { accessor: "name" }).build();

const s3 = new S3Client({ region: "us-east-1" });

async function streamToS3Multipart(key: string) {
  const workbook = createWorkbookStream({ tempStorage: "file" });
  const table = await workbook.sheet("Data").table({ id: "rows", schema });

  for await (const batch of fetchRowsCursor()) {
    await table.commit({ rows: batch });
  }

  const { UploadId } = await s3.send(
    new CreateMultipartUploadCommand({ Bucket: "my-bucket", Key: key }),
  );

  const parts: { ETag: string; PartNumber: number }[] = [];
  const readable = workbook.toNodeReadable();
  let partNumber = 1;
  const PART_SIZE = 5 * 1024 * 1024; // 5 MB minimum for S3 multipart

  for await (const chunk of Readable.from(readable, { objectMode: false })) {
    const { ETag } = await s3.send(
      new UploadPartCommand({
        Bucket: "my-bucket",
        Key: key,
        UploadId,
        PartNumber: partNumber,
        Body: chunk,
      }),
    );
    parts.push({ ETag: ETag!, PartNumber: partNumber++ });
  }

  await s3.send(
    new CompleteMultipartUploadCommand({
      Bucket: "my-bucket",
      Key: key,
      UploadId,
      MultipartUpload: { Parts: parts },
    }),
  );
}

Stream with progress tracking

Wrap .commit() calls with a counter to report progress back to a job queue, SSE connection, or log.

import { createExcelSchema, createWorkbookStream } from "@chronicstone/typed-xlsx";

const schema = createExcelSchema<{ id: string; value: number }>()
  .column("id", { accessor: "id" })
  .column("value", { accessor: "value" })
  .build();

async function exportWithProgress(jobId: string, onProgress: (pct: number) => void) {
  const workbook = createWorkbookStream({ tempStorage: "file" });
  const table = await workbook.sheet("Export").table({ id: "data", schema });

  const totalRows = await getRowCount();
  let committed = 0;

  for await (const batch of fetchRowsCursor()) {
    await table.commit({ rows: batch });
    committed += batch.length;
    onProgress(Math.round((committed / totalRows) * 100));
  }

  await workbook.writeToFile(`./exports/${jobId}.xlsx`);
}

Parallel schema preparation, serial commit

Schema building is pure TypeScript — it does no async I/O. You can build schemas and even begin creating stream tables concurrently, as long as you commit rows to each table serially.

import { createExcelSchema, createWorkbookStream } from "@chronicstone/typed-xlsx";

const ordersSchema = createExcelSchema<{ id: string; total: number }>()
  .column("id", { accessor: "id" })
  .column("total", { accessor: "total" })
  .build();

const returnsSchema = createExcelSchema<{ orderId: string; reason: string }>()
  .column("orderId", { accessor: "orderId" })
  .column("reason", { accessor: "reason" })
  .build();

const workbook = createWorkbookStream();

// Create both tables concurrently — safe, each is on its own sheet
const [ordersTable, returnsTable] = await Promise.all([
  workbook.sheet("Orders").table({ id: "orders", schema: ordersSchema }),
  workbook.sheet("Returns").table({ id: "returns", schema: returnsSchema }),
]);

// Fetch and commit both pipelines concurrently
await Promise.all([
  (async () => {
    for await (const batch of fetchOrders()) {
      await ordersTable.commit({ rows: batch });
    }
  })(),
  (async () => {
    for await (const batch of fetchReturns()) {
      await returnsTable.commit({ rows: batch });
    }
  })(),
]);

await workbook.writeToFile("./report.xlsx");

For HTTP response streaming, use the output-target examples in the streaming docs. The underlying pattern is the same: commit rows in batches, then finalize with pipeToNode() or toNodeReadable() once the workbook is ready.

Memory Tuning

tempStorage, memoryProfile, and strings options and when to use each.

Overview

Memory models for buffered and stream mode, where time is spent, and throughput characteristics.