Global Entity Matcher (GEM)

API workflow

Request access

API workflow overview

A typical API-based workflow follows these steps:

  1. Authentication

  2. Storage Access

    • List available storages (GET /storages)
    • Get storage credentials (GET /storages/{id}/credentials)
    • Finish Storage Authorization step to be authorized to access the storage with the obtained credentials
    • Make sure your data has the correct format, see Data preparation guide
    • Upload data using Azure CLI (replace placeholders with actual values):
      az storage blob upload --account-name "<STORAGE_ACCOUNT_NAME>" \
      --container-name "default" \
      --name "<YOUR_FILE_NAME>.parquet" \
      --file "/path/to/<YOUR_FILE_NAME>.parquet" \
      --auth-mode login
  3. Job Execution

  4. Results Download

    • Check output file schema in Output data schema
    • Download results using Azure CLI with storage credentials (replace placeholders with actual values):
      az storage blob download --account-name "<STORAGE_ACCOUNT_NAME>" \
      --container-name "default" \
      --name "<YOUR_FILE_NAME>.results.parquet" \
      --file "/path/to/<YOUR_FILE_NAME>.results.parquet" \
      --auth-mode login

Resources