Getting Started

The PDF Extract API provides modern cloud-based capabilities for automatically extracting contents from PDF. The API is accessible through SDKs which help you get up and running quickly. Once you've received your developer credential, download and set up one of the sample projects. After you're familiar with the APIs, leverage the samples in your own server-side code.

The SDK only supports server-based use cases where credentials are saved securely in a safe environment. SDK credentials should not be sent to untrusted environments or end user devices.

Step 1 : Getting the access token

PDF Services API endpoints are authenticated endpoints. Getting an access token is a two-step process :

Get Credentials Invoking PDF Services API requires an Adobe-provided credential. To get one, click here, and complete the workflow. Be sure to copy and save the credential values to a secure location.
Retrieve Access Token The PDF Services APIs require an access_token to authorize the request. Use the "Get AccessToken" API from the Postman Collection with your client_id, client_secret (mentioned in the pdfservices-api-credentials.json file downloaded in 1) to get the access_token OR directly use the below mentioned cURL to get the access_token.

REST API

curl --location 'https://pdf-services.adobe.io/token' \
--header 'Content-Type: application/x-www-form-urlencoded' \
--data-urlencode 'client_id={{Placeholder for Client ID}}' \
--data-urlencode 'client_secret={{Placeholder for Client Secret}}'
Copied to your clipboard
curl --location 'https://pdf-services.adobe.io/token' \
--header 'Content-Type: application/x-www-form-urlencoded' \
--data-urlencode 'client_id={{Placeholder for Client ID}}' \
--data-urlencode 'client_secret={{Placeholder for Client Secret}}'

Step 2 : Uploading an asset

After getting the access token, we need to upload the asset. Uploading an asset is a two-step process :

First you need to get an upload pre-signed URI by using the following API.

You can read more about the API in detail here.

REST API

curl --location --request POST 'https://pdf-services.adobe.io/assets' \
--header 'X-API-Key: {{Placeholder for client_id}}' \
--header 'Authorization: Bearer {{Placeholder for token}}' \
--header 'Content-Type: application/json' \
--data-raw '{
    "mediaType": "{{Placeholder for mediaType}}"
}'
Copied to your clipboard
curl --location --request POST 'https://pdf-services.adobe.io/assets' \
--header 'X-API-Key: {{Placeholder for client_id}}' \
--header 'Authorization: Bearer {{Placeholder for token}}' \
--header 'Content-Type: application/json' \
--data-raw '{
    "mediaType": "{{Placeholder for mediaType}}"
}'

On getting a 200 response status from the above API, use the uploadUri field in the response body of the above API to upload the asset directly to the cloud provider using a PUT API call. You will also get an assetID field which will be used in creating the job.

REST API

curl --location -g --request PUT 'https://dcplatformstorageservice-prod-us-east-1.s3-accelerate.amazonaws.com/b37fd583-1ab6-4f49-99ef-d716180b5de4?X-Amz-Security-Token={{Placeholder for X-Amz-Security-Token}}&X-Amz-Algorithm={{Placeholder for X-Amz-Algorithm}}&X-Amz-Date={{Placeholder for X-Amz-Date}}&X-Amz-SignedHeaders={{Placeholder for X-Amz-SignedHeaders}}&X-Amz-Expires={{Placeholder for X-Amz-Expires}}&X-Amz-Credential={{Placeholder for X-Amz-Credential}}&X-Amz-Signature={{Placeholder for X-Amz-Signature}}' \
--header 'Content-Type: application/pdf' \
--data-binary '@{{Placeholder for file path}}'
Copied to your clipboard
curl --location -g --request PUT 'https://dcplatformstorageservice-prod-us-east-1.s3-accelerate.amazonaws.com/b37fd583-1ab6-4f49-99ef-d716180b5de4?X-Amz-Security-Token={{Placeholder for X-Amz-Security-Token}}&X-Amz-Algorithm={{Placeholder for X-Amz-Algorithm}}&X-Amz-Date={{Placeholder for X-Amz-Date}}&X-Amz-SignedHeaders={{Placeholder for X-Amz-SignedHeaders}}&X-Amz-Expires={{Placeholder for X-Amz-Expires}}&X-Amz-Credential={{Placeholder for X-Amz-Credential}}&X-Amz-Signature={{Placeholder for X-Amz-Signature}}' \
--header 'Content-Type: application/pdf' \
--data-binary '@{{Placeholder for file path}}'

Step 3 : Creating the job

To create a job for the operation, please use the assetID obtained in Step 2 in the API request body. On successful job submission you will get a status code of 201 and a response header location which will be used for polling.

For creating the job, please refer to the corresponding API spec for the particular PDF Operation.

Step 4 : Fetching the status

Once the job is successfully created, you need to poll the at the location returned in response header of Step 3 by using the following API

You can read more about the API in detail here.

REST API

curl --location -g --request GET 'https://pdf-services.adobe.io/operation/compresspdf/{{Placeholder for job id}}/status' \
--header 'Authorization: Bearer {{Placeholder for token}}' \
--header 'x-api-key: {{Placeholder for client id}}'
Copied to your clipboard
curl --location -g --request GET 'https://pdf-services.adobe.io/operation/compresspdf/{{Placeholder for job id}}/status' \
--header 'Authorization: Bearer {{Placeholder for token}}' \
--header 'x-api-key: {{Placeholder for client id}}'

Instead of using the status API to check the job status, you have the option to receive job completion notifications through Webhook. For more information, please refer the documentation.

Step 5 : Downloading the asset

On getting 200 response code from the poll API, you will receive a status field in the response body which can either be in progress, done or failed.

If the status field is in progress you need to keep polling the location until it changes to done or failed.

If the status field is done the response body will also have a download pre-signed URI in the dowloadUri field, which will be used to download the asset directly from cloud provider by making the following API call

You can read more about the API in detail here.

REST API

curl --location -g --request GET 'https://dcplatformstorageservice-prod-us-east-1.s3-accelerate.amazonaws.com/b37fd583-1ab6-4f49-99ef-d716180b5de4?X-Amz-Security-Token={{Placeholder for X-Amz-Security-Token}}&X-Amz-Algorithm={{Placeholder for X-Amz-Algorithm}}&X-Amz-Date={{Placeholder for X-Amz-Date}}&X-Amz-SignedHeaders={{Placeholder for X-Amz-SignedHeaders}}&X-Amz-Expires={{Placeholder for X-Amz-Expires}}&X-Amz-Credential={{Placeholder for X-Amz-Credential}}&X-Amz-Signature={{Placeholder for X-Amz-Signature}}'
Copied to your clipboard
curl --location -g --request GET 'https://dcplatformstorageservice-prod-us-east-1.s3-accelerate.amazonaws.com/b37fd583-1ab6-4f49-99ef-d716180b5de4?X-Amz-Security-Token={{Placeholder for X-Amz-Security-Token}}&X-Amz-Algorithm={{Placeholder for X-Amz-Algorithm}}&X-Amz-Date={{Placeholder for X-Amz-Date}}&X-Amz-SignedHeaders={{Placeholder for X-Amz-SignedHeaders}}&X-Amz-Expires={{Placeholder for X-Amz-Expires}}&X-Amz-Credential={{Placeholder for X-Amz-Credential}}&X-Amz-Signature={{Placeholder for X-Amz-Signature}}'

There you go! Your job is completed in 5 simple steps.

SDK

PDF Services API is also accessible via SDKs in popular languages such as Java, .NET, Node JS and Python.

Please allow-list the following hostnames before using Adobe PDF Services SDK:

ims-na1.adobelogin.com (Required for all the clients)

For clients using SDK version 4.x and above :

Using United States (Default) region for processing documents :
- dcplatformstorageservice-prod-us-east-1.s3-accelerate.amazonaws.com (Not required, if using external storage for both input and output)
- pdf-services-ue1.adobe.io
- pdf-services.adobe.io (Default URI)

Using Europe region for processing documents :
- dcplatformstorageservice-prod-eu-west-1.s3.amazonaws.com (Not required, if using external storage for both input and output)
- pdf-services-ew1.adobe.io

For clients using SDK version 3.x and above :

Using United States region for processing documents :
- dcplatformstorageservice-prod-us-east-1.s3-accelerate.amazonaws.com
- pdf-services-ue1.adobe.io
- pdf-services.adobe.io (Default URI)

Using Europe region for processing documents :
- dcplatformstorageservice-prod-eu-west-1.s3.amazonaws.com
- pdf-services-ew1.adobe.io

For clients using SDK version upto 2.x :

cpf-ue1.adobe.io

Java

Jump start your development by bookmarking or downloading the following key resources:

This document
API reference (Javadoc)
Java Sample code
Java library. The Maven project contains the .jar file.

Authentication

Once you complete the Getting Credentials, a zip or json file automatically downloads that contains content whose structure varies based on whether you opted to download personalized code samples.

Personalized Download: Downloads the zip which contains adobe-dc-pdf-services-sdk-java-samples with a preconfigured pdfservices-api-credentials.json file.
Non Personalized Download: Downloads the pdfservices-api-credentials.json with your preconfigured credentials.

After downloading the zip, you can run the samples in the zip directly by setting up the two environment variables PDF_SERVICES_CLIENT_ID and PDF_SERVICES_CLIENT_SECRET by running the following cammands :

Windows:
- set PDF_SERVICES_CLIENT_ID=<YOUR CLIENT ID>
- set PDF_SERVICES_CLIENT_SECRET=<YOUR CLIENT SECRET>
MacOS/Linux:
- export PDF_SERVICES_CLIENT_ID=<YOUR CLIENT ID>
- export PDF_SERVICES_CLIENT_SECRET=<YOUR CLIENT SECRET>

Example pdfservices-api-credentials.json file

{
    "client_credentials": {
        "client_id": "<YOUR_CLIENT_ID>",
        "client_secret": "<YOUR_CLIENT_SECRET>"
    },
    "service_principal_credentials": {
       "organization_id": "<YOUR_ORGNIZATION_ID>"
    }
}
Copied to your clipboard
{
    "client_credentials": {
        "client_id": "<YOUR_CLIENT_ID>",
        "client_secret": "<YOUR_CLIENT_SECRET>"
    },
    "service_principal_credentials": {
       "organization_id": "<YOUR_ORGNIZATION_ID>"
    }
}

Setup a Java environment

Install Java 11 or above.
Run javac -version to verify your install.
Verify the JDK bin folder is included in the PATH variable (method varies by OS).
Install Maven. You may use your preferred tool; for example:
- Windows: Example: Chocolatey.
- Macintosh: Example: brew install maven.

Maven uses pom.xml to fetch pdfservices-sdk from the public Maven repository when running the project. The .jar automatically downloads when you build the sample project. Alternatively, you can download the pdfservices-sdk.jar file, and configure your own environment.

Running the samples

The quickest way to get up and running is to download the code samples during the Getting Credentials workflow. These samples provide everything from ready-to-run sample code, an embedded credential json file, and pre-configured connections to dependencies.

Download the Java sample project.
Build the sample project with Maven: mvn clean install.
Set the environment variables PDF_SERVICES_CLIENT_ID and CLIENT_SECRET by running the following commands :

Windows:
- set PDF_SERVICES_CLIENT_ID=<YOUR CLIENT ID>
- set PDF_SERVICES_CLIENT_SECRET=<YOUR CLIENT SECRET>
MacOS/Linux:
- export PDF_SERVICES_CLIENT_ID=<YOUR CLIENT ID>
- export PDF_SERVICES_CLIENT_SECRET=<YOUR CLIENT SECRET>

Test the sample code on the command line.
Refer to this document for details about running samples as well as the API Reference for API details.

Command line execution is not mandatory. You can import the samples Maven project into your preferred IDE (e.g. IntelliJ/Eclipse) and run the samples from there.

Verifying download authenticity

For security reasons you may wish to confirm the installer's authenticity. To do so,

After installing the package, navigate to the .jar.sha1 file.
Calculate the hash with any 3rd party utility.
Find and open PDF Services sha1 file. Note: if you're using Maven, look in the .m2 directory.
Verify the hash you generated matches the value in the .sha1 file.

5d49322ced35a8195378a7ed297b4fb721780736
Copied to your clipboard
5d49322ced35a8195378a7ed297b4fb721780736

Logging

Refer to the API docs for error and exception details.

For logging, use the slf4j API with a log4js-slf4j binding.
Logging configurations are provided in src/main/resources/log4js.properties.
Specify alternate bindings, if required, in pom.xml.

log4js.properties file

name=PropertiesConfig
appenders = console

# A sample console appender configuration, Clients can change as per their logging implementation
rootLogger.level = WARN
rootLogger.appenderRefs = stdout
rootLogger.appenderRef.stdout.ref = STDOUT

appender.console.type = Console
appender.console.name = STDOUT
appender.console.layout.type = PatternLayout
appender.console.layout.pattern = [%-5level] %d{yyyy-MM-dd HH:mm:ss.SSS} [%t] %c{1} - %msg%n

loggers = pdfservicessdk,validator,apache

# Change the logging levels as per need. INFO is recommended for pdfservices-sdk
logger.pdfservicessdk.name = com.adobe.pdfservices.operation
logger.pdfservicessdk.level = INFO
logger.pdfservicessdk.additivity = false
logger.pdfservicessdk.appenderRef.console.ref = STDOUT

logger.validator.name=org.hibernate
logger.validator.level=WARN

logger.apache.name=org.apache
logger.apache.level=WARN
Copied to your clipboard
name=PropertiesConfig
appenders = console

# A sample console appender configuration, Clients can change as per their logging implementation
rootLogger.level = WARN
rootLogger.appenderRefs = stdout
rootLogger.appenderRef.stdout.ref = STDOUT

appender.console.type = Console
appender.console.name = STDOUT
appender.console.layout.type = PatternLayout
appender.console.layout.pattern = [%-5level] %d{yyyy-MM-dd HH:mm:ss.SSS} [%t] %c{1} - %msg%n

loggers = pdfservicessdk,validator,apache

# Change the logging levels as per need. INFO is recommended for pdfservices-sdk
logger.pdfservicessdk.name = com.adobe.pdfservices.operation
logger.pdfservicessdk.level = INFO
logger.pdfservicessdk.additivity = false
logger.pdfservicessdk.appenderRef.console.ref = STDOUT

logger.validator.name=org.hibernate
logger.validator.level=WARN

logger.apache.name=org.apache
logger.apache.level=WARN

Samples directory structure Java

Custom projects

While the samples use Maven, you can use your own tools and process.

To build a custom project:

Access the .jar in the central Maven repository.
Use your preferred dependency management tool (Ivy, Gradle, Maven), to include the SDK .jar dependency.
Open the pdfservices-api-credentials.json downloaded when you created your credential.
Add the Authentication details as described above.

Adobe PDF Services SDK On Maven

.NET

Jumpstart your development by bookmarking or downloading the following key resources:

This document
Nuget package
.NET API reference
.NET Sample code
Input/output test files reside in the their respective sample directories

Prerequisites

The samples project requires the following:

NET: version 8.0 or above
A build Tool: Either Visual Studio or .NET Core CLI.

Authentication

Once you complete the Getting Credentials, a zip or json file automatically downloads that contains content whose structure varies based on whether you opted to download personalized code samples.

Personalized Download: Downloads the zip which contains adobe-DC.PDFServices.SDK.NET.Samples with a preconfigured pdfservices-api-credentials.json file.
Non Personalized Download: Downloads the pdfservices-api-credentials.json with your preconfigured credentials.

Windows:
- set PDF_SERVICES_CLIENT_ID=<YOUR CLIENT ID>
- set PDF_SERVICES_CLIENT_SECRET=<YOUR CLIENT SECRET>
MacOS/Linux:
- export PDF_SERVICES_CLIENT_ID=<YOUR CLIENT ID>
- export PDF_SERVICES_CLIENT_SECRET=<YOUR CLIENT SECRET>

Example pdfservices-api-credentials.json file

{
    "client_credentials": {
        "client_id": "<YOUR_CLIENT_ID>",
        "client_secret": "<YOUR_CLIENT_SECRET>"
    },
    "service_principal_credentials": {
       "organization_id": "<YOUR_ORGNIZATION_ID>"
    }
}
Copied to your clipboard
{
    "client_credentials": {
        "client_id": "<YOUR_CLIENT_ID>",
        "client_secret": "<YOUR_CLIENT_SECRET>"
    },
    "service_principal_credentials": {
       "organization_id": "<YOUR_ORGNIZATION_ID>"
    }
}

Set up a NET environment

Running any sample or custom code requires the following:

Download and install the .NET SDK.

The Nuget package automatically downloads when you build the sample project.

Running the samples

Clone or download the samples project.
From the samples directory, build the sample project: dotnet build.
Set the environment variables PDF_SERVICES_CLIENT_ID and PDF_SERVICES_CLIENT_SECRET by running the following commands :

Windows:
- set PDF_SERVICES_CLIENT_ID=<YOUR CLIENT ID>
- set PDF_SERVICES_CLIENT_SECRET=<YOUR CLIENT SECRET>
MacOS/Linux:
- export PDF_SERVICES_CLIENT_ID=<YOUR CLIENT ID>
- export PDF_SERVICES_CLIENT_SECRET=<YOUR CLIENT SECRET>

Test the sample code on the command line.
Refer to this document for details about running samples as well as the API Reference for API details.

Verifying download authenticity

For security reasons you may wish to confirm the installer's authenticity. To do so,

After installing the Nuget package, navigate to the .nuget directory.
Find and open the .sha512 file.
Verify the hash in the downloaded file matches the value published here.

PAJ0eB+wUhgvLcIcH37IUgfFV7TxG2wWhcfaON57hWmpDFiPqAM6kgKFfyzAWVAnkzAQoQEWLPxgSIASNy5d8A==
Copied to your clipboard
PAJ0eB+wUhgvLcIcH37IUgfFV7TxG2wWhcfaON57hWmpDFiPqAM6kgKFfyzAWVAnkzAQoQEWLPxgSIASNy5d8A==

Logging

Refer to the API docs for error and exception details.

The .NET SDK uses LibLog as a bridge between different logging frameworks. Log4net is used as a logging provider in the sample projects and the logging configurations are provided in log4net.config. Add the configuration for your preferred provider and set up the necessary appender as required to enable logging.

log4net.config file

<log4net>
 <root>
   <level value="INFO" />
   <appender-ref ref="console" />
 </root>
 <appender name="console" type="log4net.Appender.ConsoleAppender">
   <layout type="log4net.Layout.PatternLayout">
     <conversionPattern value="%date %level %logger - %message%newline" />
   </layout>
 </appender>
</log4net>
Copied to your clipboard
<log4net>
 <root>
   <level value="INFO" />
   <appender-ref ref="console" />
 </root>
 <appender name="console" type="log4net.Appender.ConsoleAppender">
   <layout type="log4net.Layout.PatternLayout">
     <conversionPattern value="%date %level %logger - %message%newline" />
   </layout>
 </appender>
</log4net>

Custom projects

While building the sample project automatically downloads the Nuget package, you can do it manually if you wish to use your own tools and process.

Go to https://www.adobe.com/go/pdftoolsapi_net_nuget.
Download the latest package.

Adobe PDF Services SDK on Nuget

Node.js

Jumpstart your development by bookmarking or downloading the following key resources:

Authentication

Once you complete the Getting Credentials, a zip or json file automatically downloads that contains content whose structure varies based on whether you opted to download personalized code samples.

Personalized Download: Downloads the zip which contains adobe-dc-pdf-services-sdk-java-samples with a preconfigured pdfservices-api-credentials.json file.
Non Personalized Download: Downloads the pdfservices-api-credentials.json with your preconfigured credentials.

Windows:
- set PDF_SERVICES_CLIENT_ID=<YOUR CLIENT ID>
- set PDF_SERVICES_CLIENT_SECRET=<YOUR CLIENT SECRET>
MacOS/Linux:
- export PDF_SERVICES_CLIENT_ID=<YOUR CLIENT ID>
- export PDF_SERVICES_CLIENT_SECRET=<YOUR CLIENT SECRET>

Example pdfservices-api-credentials.json file

{
    "client_credentials": {
        "client_id": "<YOUR_CLIENT_ID>",
        "client_secret": "<YOUR_CLIENT_SECRET>"
    },
    "service_principal_credentials": {
       "organization_id": "<YOUR_ORGNIZATION_ID>"
    }
}
Copied to your clipboard
{
    "client_credentials": {
        "client_id": "<YOUR_CLIENT_ID>",
        "client_secret": "<YOUR_CLIENT_SECRET>"
    },
    "service_principal_credentials": {
       "organization_id": "<YOUR_ORGNIZATION_ID>"
    }
}

Set up a Node.js environment

Running any sample or custom code requires the following steps:

Install Node.js 18.0 or higher.

The @adobe/pdfservices-node-sdk npm package automatically downloads when you build the sample project.

npm install --save @adobe/pdfservices-node-sdk
Copied to your clipboard
npm install --save @adobe/pdfservices-node-sdk

Running the samples

Download the Node.js sample project .
From the samples root directory, run npm install.
Set the environment variables PDF_SERVICES_CLIENT_ID and PDF_SERVICES_CLIENT_SECRET by running the following commands :

Windows:
- set PDF_SERVICES_CLIENT_ID=<YOUR CLIENT ID>
- set PDF_SERVICES_CLIENT_SECRET=<YOUR CLIENT SECRET>
MacOS/Linux:
- export PDF_SERVICES_CLIENT_ID=<YOUR CLIENT ID>
- export PDF_SERVICES_CLIENT_SECRET=<YOUR CLIENT SECRET>

Test the sample code on the command line.
Refer to this document for details about running samples as well as the API Reference for API details.

Verifying download authenticity

For security reasons you may wish to confirm the installer's authenticity. To do so,

After installing the package, find and open package.json.
Find the "_integrity" key.
Verify the hash in the downloaded file matches the value published here.

sha512-ZvwGfMlGa0mGK5HpPAdTTlsaeKKPLEbVfAeLsHr7YAQ6NO7cVkbxaPqV3Ng8dpq4mNj5Ah0Y1Q80uTjLb0Qf+A==
Copied to your clipboard
sha512-ZvwGfMlGa0mGK5HpPAdTTlsaeKKPLEbVfAeLsHr7YAQ6NO7cVkbxaPqV3Ng8dpq4mNj5Ah0Y1Q80uTjLb0Qf+A==

Logging

Refer to the API docs for error and exception details.

The SDK uses the log4js API for logging. During execution, the SDK searches for config/pdfservices-sdk-log4js-config.json in the working directory and reads the logging properties from there. If you do not provide a configuration file, the default logging logs INFO to the console. Customize the logging settings as needed.

log4js.properties file

{
  "appenders": {
    "consoleAppender": {
      "_comment": "A sample console appender configuration, Clients can change as per their logging implementation",
      "type": "console",
      "layout": {
        "type": "pattern",
        "pattern": "%d:[%p]: %m"
      }
    }
  },
  "categories": {
    "default": {
      "appenders": ["consoleAppender"],
      "_comment": "Change the logging levels as per need. info is recommended for pdfservices-node-sdk",
      "level": "info"
    }
  }
} 
Copied to your clipboard
{
  "appenders": {
    "consoleAppender": {
      "_comment": "A sample console appender configuration, Clients can change as per their logging implementation",
      "type": "console",
      "layout": {
        "type": "pattern",
        "pattern": "%d:[%p]: %m"
      }
    }
  },
  "categories": {
    "default": {
      "appenders": ["consoleAppender"],
      "_comment": "Change the logging levels as per need. info is recommended for pdfservices-node-sdk",
      "level": "info"
    }
  }
}

Custom projects

While building the sample project automatically downloads the Node package, you can do it manually if you wish to use your own tools and process.

Go to https://www.npmjs.com/package/@adobe/pdfservices-node-sdk
Download the latest package.

Adobe PDF Services SDK on NPM JS

Python

Jump start your development by bookmarking or downloading the following key resources:

Authentication

Once you complete the Getting Credentials, a zip or json file automatically downloads that contains content whose structure varies based on whether you opted to download personalized code samples.

Personalized Download: Downloads the zip which contains adobe-dc-pdf-services-sdk-java-samples with a preconfigured pdfservices-api-credentials.json file.
Non Personalized Download: Downloads the pdfservices-api-credentials.json with your preconfigured credentials.

Windows:
- set PDF_SERVICES_CLIENT_ID=<YOUR CLIENT ID>
- set PDF_SERVICES_CLIENT_SECRET=<YOUR CLIENT SECRET>
MacOS/Linux:
- export PDF_SERVICES_CLIENT_ID=<YOUR CLIENT ID>
- export PDF_SERVICES_CLIENT_SECRET=<YOUR CLIENT SECRET>

Example pdfservices-api-credentials.json file

{
    "client_credentials": {
        "client_id": "<YOUR_CLIENT_ID>",
        "client_secret": "<YOUR_CLIENT_SECRET>"
    },
    "service_principal_credentials": {
       "organization_id": "<YOUR_ORGNIZATION_ID>"
    }
}
Copied to your clipboard
{
    "client_credentials": {
        "client_id": "<YOUR_CLIENT_ID>",
        "client_secret": "<YOUR_CLIENT_SECRET>"
    },
    "service_principal_credentials": {
       "organization_id": "<YOUR_ORGNIZATION_ID>"
    }
}

Setup a Python environment

Running any sample or custom code requires the following steps:

Install Python 3.10 or higher.
Verify your installation by running this command: python --version.

pip uses requirements.txt file to fetch pdfservices-sdk from the public PyPi repository when running the project. The .whl automatically downloads when you build the sample project. Alternatively, you can download the pdfservices-sdk .whl file, and configure your own environment.

Running the samples

Download and extract the Python sample project.
Copy the downloaded zip to the directory that you set up for this project and unzip the files there.
Set the environment variables PDF_SERVICES_CLIENT_ID and CLIENT_SECRET by running the following commands :

Windows:
- set PDF_SERVICES_CLIENT_ID=<YOUR CLIENT ID>
- set PDF_SERVICES_CLIENT_SECRET=<YOUR CLIENT SECRET>
MacOS/Linux:
- export PDF_SERVICES_CLIENT_ID=<YOUR CLIENT ID>
- export PDF_SERVICES_CLIENT_SECRET=<YOUR CLIENT SECRET>

Go to the project directory (which contains requirements.txt file) and setup a virtual environment.

pip install virtualenv
python -m venv myenv
source myenv/bin/activate # on windows, use `myenv\Scripts\activate`
Copied to your clipboard
pip install virtualenv
python -m venv myenv
source myenv/bin/activate # on windows, use `myenv\Scripts\activate`

Now build the sample project using this command in terminal: pip install -r requirements.txt.
Test the sample code on the command line.
Refer to this document for details about running samples as well as the API Reference for API details.
You can import the samples into your preferred IDE and run the samples from there or run the commands from the README.md file in terminal.

Verifying download authenticity

For security reasons you may wish to confirm the installer's authenticity. To do so,

After downloading the package zip, run following command

pip hash <download_dir>/pdfservices-sdk-4.2.0.tar.gz
Copied to your clipboard
pip hash <download_dir>/pdfservices-sdk-4.2.0.tar.gz

Above command will return the hash of downloaded package.
Verify the hash matches the value published here.

378afa7d3b22683264a1817393932a649b98e4d4b3913816f839de240f7132d5
Copied to your clipboard
378afa7d3b22683264a1817393932a649b98e4d4b3913816f839de240f7132d5

To extract data from the sample PDF file

python src/extractpdf/extract_text_table_info_with_renditions_from_pdf.py
Copied to your clipboard
python src/extractpdf/extract_text_table_info_with_renditions_from_pdf.py

Note: The above commands run on the input file “extractPdfInput.pdf” present in “src/main/resources” directory and generate result in “output” directory inside the project. If the output files already exist, the commands will report an error.

Other sample programs in the SDK let you try out API options including generating a tagging report, and shifting headings so that the document title is only H1.

Public API

PDF Services API is accessible directly via REST APIs which requires Adobe-provided credential for authentication. Once you've completed the Getting Credentials workflow, a zip file automatically downloads that contains content whose structure varies based on whether you opted to download personalized code samples. The zip file structures are as follows:

Personalized Download: Downloads the zip which contains adobe-dc-pdf-services-sdk-java-samples with a preconfigured pdfservices-api-credentials.json file.
Non Personalized Download: Downloads the pdfservices-api-credentials.json with your preconfigured credentials.

Windows:
- set PDF_SERVICES_CLIENT_ID=<YOUR CLIENT ID>
- set PDF_SERVICES_CLIENT_SECRET=<YOUR CLIENT SECRET>
MacOS/Linux:
- export PDF_SERVICES_CLIENT_ID=<YOUR CLIENT ID>
- export PDF_SERVICES_CLIENT_SECRET=<YOUR CLIENT SECRET>

Example pdfservices-api-credentials.json file

{
    "client_credentials": {
        "client_id": "<YOUR_CLIENT_ID>",
        "client_secret": "<YOUR_CLIENT_SECRET>"
    },
    "service_principal_credentials": {
       "organization_id": "<YOUR_ORGNIZATION_ID>"
    }
}
Copied to your clipboard
{
    "client_credentials": {
        "client_id": "<YOUR_CLIENT_ID>",
        "client_secret": "<YOUR_CLIENT_SECRET>"
    },
    "service_principal_credentials": {
       "organization_id": "<YOUR_ORGNIZATION_ID>"
    }
}