Import API best practices

To best utilize the Import API, we recommend the following best practices.

Copy for LLM

View as Markdown

Effective use of Import Containers

Organize Import Containers

It is entirely up to you how to organize your Import Containers.

A general recommendation is to use more Import Containers that contain less data (over more data in fewer containers). But, it may differ based on your use case and organization/monitoring needs:

When importing full data sets, creating a new dedicated Import Container for the task can help in distinguishing imports performed on different occasions.
When performing routine data imports, reusing a dedicated Import Container may be better.
When importing data from multiple sources, using an Import Container for each source can help in organizing and monitoring progress.
When performing high-volume, temporary imports, using ephemeral Import Containers BETA can help manage resources effectively.

In these three use cases, Import Containers would be organized by resource type, reusing a container for recurring import activities, or by data source.

Use Case	Possible Import Container organization
Import Product and Category	Create separate Import Containers for Product and Category
Import Price changes daily at 5 PM	Create a reusable Import Container. If there are more than `200 000` imports per day, this may be broken down by some other business logic or temporary Import Container for excess counts.
Import Product changes from multiple sources	Create one Import Container per source for Product imports.

Optimize performance

To achieve the best performance with the Import API, we recommended having fewer than 200 000 Import Operations per Import Container. This way, monitoring activities at the container level will not be costly.

As Import Operations are automatically deleted 48 hours after they are created, you can reuse Import Containers over time. An example schedule is as follows:

Day	Import Operation total count	Import Containers (Import Operation count)
Day 1	100,000	container-a (100,000)
Day 2	500,000	container-a (100,000), container-b (200,000), container-c (200,000)
Day 3	400,000	container-a (0), container-b (200,000), container-c (200,000)
Day 4	200,000	container-a (200,000), container-b (0), container-c (0)

On Day 3, container-a will be empty as all the Import Operations have reached 48 hours. container-a is now ready to be reused. Similarly container-b and container-c can be reused from Day 4.

Clean up data from Import Containers and remove unused Import Containers

You do not need to clean up Import Containers as Import Operations are automatically deleted 48 hours after they are created.

You can delete Import Containers at your own convenience. This will immediately delete all Import Operations in the container. However, data that has been imported to your Project will not be affected.

Automatically delete Import Containers BETA

You can create ephemeral Import Containers by setting a retentionPolicy in the ImportContainerDraft when creating the ImportContainer.

If you set a TimeToLiveRetentionPolicy, your ImportContainer will have an expiresAt field based on your defined timeToLive value. Once the expiresAt datetime is reached, the ImportContainer and its Import Operations are permanently deleted. This setting is especially useful for workflows that generate many temporary Import Containers which you do not intend to reuse.

We recommend setting a sufficient buffer time for your retention policy to ensure all import operations complete before the Import Container expires. The lowest accepted time to live value is 1 hour (1h) and the highest is 30 days (30d).

Import large data sets

You can import 20 resources per Import Request. If you have a huge number of resources to import, we recommend using thread optimization to send your data as fast as possible to an Import Container.

You can import as much data you need using the Import API, keeping in mind the best practices for rate limits and JSON document size limit.

Please note that the asynchronous import process starts as soon as the first Import Request is received by the Import Container.

For more information on arranging Import Containers for importing large data sets, see optimize performance.

Rate limits

The Import API does not have rate limits, but to ensure the best performance we recommend that you send 300 API calls per second, per Project, to the Import API.

For example, if you send 300 Import Requests for Categories per second, and each CategoryImportRequest contains 20 CategoryImport items, this means that up to 360,000 Categories can be sent to the Import API every minute.

Choose the right Product import endpoint

When updating an existing Product or ProductVariant by import, you must include the existing values of all fields you want to keep in ProductDraftImport, ProductImport, and ProductVariantImport. The import process will remove the values of omitted fields.

The Import API has multiple endpoints for importing Product data. The following table summarizes what can be imported by each endpoint.

Endpoint	What can be imported
Product Drafts	Product data including Product Variants and Prices.
Products	Product data without Product Variants and Prices.
Product Variants	Product Variant data without Prices.
Product Variant Patches	Product Variant Attribute data.
Embedded Prices	Price data for a specific Product Variant.

The following information explains common use cases for each endpoint.

Product Drafts

Use import requests for Product Drafts for large payloads that include complete sets of Product data including Product Variants and Embedded Prices. Effective use of this endpoint can remove the need to create three separate import requests for (Products, Product Variants, and Embedded Prices).

Common use cases:

when you want to update a Product with 10+ new Product Variants, each with 20+ Embedded Prices.
when you want to update a large number of Product Variants with new Attribute values and Prices.

Products

Use import requests for Products to create or update Products without Product Variant or Price data. As ProductImport does not import any ProductVariant or Price data, it will result in better performance due to the smaller payload.

Product Variants

Use import requests for Product Variants to create or update Product Variants without Price data.

Product Variant Patches

Use import requests for Product Variant Patches to update the Attributes of existing Product Variants.

Embedded Prices

Use import requests for Embedded Prices to update the Price data of existing Product Variants.

Manage published state of Products

Both ProductImport and ProductDraftImport have the field publish which accepts a boolean value. The result of using this field varies based on whether you are importing data to create or update Products.

When importing data to create a Product

If true, the Product is created and published immediately to the current projection. If false, the Product is created but not published.

When importing data to update an existing Product

The result of updating existing Products depends on if you are including changes in your import request, and if the Product currently has staged changes.

Value of `publish`	Does the import request have changes?	Does the Product have staged changes before importing?	Result
`true`	No	Yes	The staged changes are applied to the current projection and the Product is published.
`true`	No	No	If the Product is currently unpublished, it is published to the current projection.
`true`	Yes	N/A	The changes are applied to both the current and staged projection. When using ProductImport: if the Product is currently unpublished, it is not published. When using ProductDraftImport: if the Product is currently unpublished, it is published.
`false`	No	N/A	The Product is unpublished.
`false`	Yes	N/A	The changes are applied to the staged projection, the Product is unpublished, and `hasStagedChanges` becomes `true`.

Utilize the lifetime of Import Operations

Import Operations are kept for 48 hours to allow you to send other resources, including those referenced by other resources (unresolved references), during this time period.

For example, one of your teams is responsible for Product import but the business validation usually delays the Product import for 1-2 days, and another team that imports Prices is very fast in importing the data, the Import API keeps the Price data for up of 48 hours and waits for the Product to be imported.

Handle retries

You only need to retry if your Import Operation has the rejected status. In other cases, the Import API will handle the retry internally without you needing to do anything.

What not to do

Do not send duplicate import requests concurrently. Since the Import API imports data asynchronously, the order is not guaranteed. It may also lead to a concurrent modification error.

In case of errors, do not Query ImportOperations in ImportContainer or Get ImportSummary of ImportContainer frequently without fixing the problems as it may slow down the import process. For assistance in debugging issues or errors, query the Import Operation and consult the errors field.

Avoid concurrency errors

When importing Product Variant Patches, provide the reference to the Product that contains the Product Variant. The value for the product field in the ProductVariantPatch minimizes concurrency errors during the import process.

If you set the product field on one ProductVariantPatch, you have to set it for every ProductVariantPatch in the same ProductVariantPatchRequest. Otherwise, the API returns an InvalidField error.

Import API best practices

Effective use of Import Containers.css-c0cbbx{display:inline-block;}.css-c0cbbx >:disabled{pointer-events:none;}

Organize Import Containers.css-c0cbbx{display:inline-block;}.css-c0cbbx >:disabled{pointer-events:none;}

Optimize performance.css-c0cbbx{display:inline-block;}.css-c0cbbx >:disabled{pointer-events:none;}

Clean up data from Import Containers and remove unused Import Containers.css-c0cbbx{display:inline-block;}.css-c0cbbx >:disabled{pointer-events:none;}

Automatically delete Import Containers BETA.css-c0cbbx{display:inline-block;}.css-c0cbbx >:disabled{pointer-events:none;}

Import large data sets.css-c0cbbx{display:inline-block;}.css-c0cbbx >:disabled{pointer-events:none;}

Rate limits.css-c0cbbx{display:inline-block;}.css-c0cbbx >:disabled{pointer-events:none;}

Choose the right Product import endpoint.css-c0cbbx{display:inline-block;}.css-c0cbbx >:disabled{pointer-events:none;}

Product Drafts.css-c0cbbx{display:inline-block;}.css-c0cbbx >:disabled{pointer-events:none;}

Products.css-c0cbbx{display:inline-block;}.css-c0cbbx >:disabled{pointer-events:none;}

Product Variants.css-c0cbbx{display:inline-block;}.css-c0cbbx >:disabled{pointer-events:none;}

Product Variant Patches.css-c0cbbx{display:inline-block;}.css-c0cbbx >:disabled{pointer-events:none;}

Embedded Prices.css-c0cbbx{display:inline-block;}.css-c0cbbx >:disabled{pointer-events:none;}

Manage published state of Products.css-c0cbbx{display:inline-block;}.css-c0cbbx >:disabled{pointer-events:none;}

When importing data to create a Product.css-c0cbbx{display:inline-block;}.css-c0cbbx >:disabled{pointer-events:none;}

When importing data to update an existing Product.css-c0cbbx{display:inline-block;}.css-c0cbbx >:disabled{pointer-events:none;}

Utilize the lifetime of Import Operations.css-c0cbbx{display:inline-block;}.css-c0cbbx >:disabled{pointer-events:none;}

Handle retries.css-c0cbbx{display:inline-block;}.css-c0cbbx >:disabled{pointer-events:none;}

What not to do.css-c0cbbx{display:inline-block;}.css-c0cbbx >:disabled{pointer-events:none;}

Avoid concurrency errors.css-c0cbbx{display:inline-block;}.css-c0cbbx >:disabled{pointer-events:none;}