Integrating data in SAP BDC with AWS data sources
Non-SAP data from AWS data sources can be harmonized with SAP data via SAP Datasphere data fabric architecture with SAP BDC. The integration architecture supports multiple AWS services, each with specific modes of integration based on live data or replication:
                
                
            A. Integration with Amazon Athena
Mode of Integration: Federating data live into SAP Datasphere
Amazon Athena is Amazon’s interactive query service that helps query and analyze data in S3. Non-SAP data from Athena can be federated live into remote tables in SAP Datasphere and augmented with SAP data for real-time analytics in SAP Analytics Cloud
Here are the steps to integrate Athena with SAP Datasphere:
- 
                  
Prepare source with non-SAP and third party data
 - 
                  
Configure Athena
 - 
                  
onfigure necessary IAM user and authorizations
 - 
                  
Setup SAP Datasphere Connection to Athena
 - 
                  
Build models in SAP Datasphere
 
This enables live data federation without replicating data, thus reduces cost, provides fast insights, and enterprise-grade security. For detailed step by step, visit Federating Queries from SAP Datasphere to Amazon S3 via Amazon Athena
B. Integration with Amazon Redshift
Mode of Integration: Federating data live into SAP Datasphere
Amazon Redshift is a fully managed, petabyte-scale data warehouse service optimized for analytical workloads. Through SAP Datasphere data federation architecture, Redshift data can be augmented with SAP data to build unified data models and analytics in SAP Analytics Cloud. Smart Data Integration (SDI)
Here are the steps to integrate Redshift with SAP Datasphere:
- 
                  
Create On-Premise Agent in SAP Datasphere
 - 
                  
Set Up Redshift Access
 - 
                  
Configure SAP SDI DP Agent
 - 
                  
Register Camel JDBC Adapter in SAP Datasphere
 - 
                  
Upload Third-Party Drivers in SAP Datasphere
 - 
                  
Create Local Connection to Redshift in SAP Datasphere
 - 
                  
Import Remote Tables from Redshift
 
This setup enables live federated queries from SAP Datasphere to Redshift without replicating the data. Benefits include real-time access to Redshift data, pushdown queries for performance optimization, and no data duplication in SAP Datasphere. For detailed step by step, visit Data Federation between SAP Datasphere and Amazon Redshift
C. Integration with Amazon S3
Modes of Integration: Replicating data with Replication Flows, Importing data into SAP Datasphere using Data Flows
Amazon S3 provides object storage service which is highly scalable, durable, available and secure. Non-SAP data from S3 buckets can be imported into SAP Datasphere through the Data Flow feature for use with applications such as Financial Planning or business analytics in SAP Analytics Cloud.
Here are the steps to integrate Amazon S3 with SAP Datasphere:
- 
                  
Prepare source data in an S3 bucket
 - 
                  
Configure necessary IAM user and authorizations
 - 
                  
Create S3 Connection in SAP Datasphere
 - 
                  
Create a Data Flow
 
This process allows SAP Datasphere to connect to S3, access non-sap data, and use that data in combination with internal SAP datasets via Data Flows. For detailed step by step, visit Data integration between SAP Datasphere and in Amazon S3
You can find out more from SAP Architecture Center under Integration with AWS data sources