Connect a SharePoint data source
After you set up authentication and store your credentials in an AWS Secrets Manager secret, create the SharePoint data source in your knowledge base. This page describes how to create the data source with the AWS Management Console or the API, followed by a reference for the connector parameters you can configure.
Note
Complete authentication setup first. See Set up Microsoft Entra ID App-Only authentication for SharePoint (recommended) or Set up OAuth 2.0 authentication for SharePoint. You need the secret ARN, your tenant ID, and (for Microsoft Entra ID App-Only authentication) the Amazon S3 location of your certificate.
Create the data source
Connector parameters
The data source configuration uses the following connector parameters. To connect to SharePoint, specify SHAREPOINT as the connector type in
connectorParameters. For the
fields that wrap connectorParameters (such as deletionProtectionConfiguration and
mediaExtractionConfiguration), see Connect a data source.
| Field | Required | Description |
|---|---|---|
secretArn |
Yes | The ARN of the AWS Secrets Manager secret containing your SharePoint credentials. |
tenantId |
Yes | The Microsoft Entra (Azure AD) tenant ID. |
authType |
Yes | The authentication type: ENTRA_ID_APP_ONLY (recommended) or OAUTH2_APP. See Authentication methods. |
certificateS3Path |
Conditional | Required for ENTRA_ID_APP_ONLY authentication, even when ACLs are disabled. Contains s3BucketName and s3KeyName for the certificate file in Amazon S3. Not used with OAUTH2_APP. |
| Field | Required | Description |
|---|---|---|
siteUrls |
Yes | List of SharePoint site URLs to crawl. Each URL is a crawl entry point and must start with https:// and point to a site, team site, or personal site — the path must begin with /sites/, /teams/, or /personal/ followed by the site name (for example, https://contoso.sharepoint.com/sites/engineering). Standard *.sharepoint.com domains and custom (vanity) domains are both supported. Within each site, the connector crawls files and pages (see crawlFiles and crawlPages); to target specific items, use inclusionItemPaths. |
crawlFiles |
No | Whether to crawl files. Defaults to true. |
crawlPages |
No | Whether to crawl pages. Defaults to true. |
| Field | Required | Description |
|---|---|---|
modifiedDateBefore |
No | Only include content modified before this date (ISO 8601). |
modifiedDateAfter |
No | Only include content modified after this date (ISO 8601). |
inclusionItemPaths |
No | Specific item paths to include. When you provide inclusionItemPaths, the connector crawls those items and does not validate siteUrls. |
| Field | Required | Description |
|---|---|---|
aclEnabled |
No | Set to true to enable document-level access control. Requires ENTRA_ID_APP_ONLY authentication. You cannot change this setting after you create the data source. For details, see Document-level access controls. |
Change the authentication method
You can change a data source's authentication method (for example, from OAuth 2.0 to Microsoft Entra ID App-Only) by updating the data source with the new authType and credentials, using the UpdateDataSource operation or the AWS Management Console. The document-level access control setting is fixed when you create a data source, so to add or remove ACLs you must create a new data source.
Next steps
After you create the data source, sync it to ingest content into your knowledge base. For details, see Sync a data source. To filter query results by user permissions, see Document-level access controls.