Mtg Notes: | Prototyped download to bucket: Option 2: AWS Data Sync Runs on schedule Cost implications/questions/ blob storage time period Requires EC2 configuration, without monitoring and auditing services Within S3--when copy is done, are the same keys within S3 in Azure? No; Metadata is not copied over to blob metadata--not impossible, but would need to create solution
Option 3: Requires EC2 configuration, without monitoring and auditing services Within S3--when copy is done, are the same keys within S3 in Azure? No; Metadata is not copied over to blob metadata--not impossible, but would need to create solution
Option 4: Pros/Cons to all options: Decision Point: Load test/Performance Test: Using meta data Option 1: Aims will need to decide to where to host, but could start that next week Option 4: Needs a bit of refinement, but also viable Option 2,3: Would need additional developer research/support to implement
Uncertain if meta data comes across with Option Data Sync: Scheduling feature, and growth of data would need to be configured/monitored AIMS/DS: Choice needs to be considered from long-term viability and ability to use for other programs/use cases GM: Bucket used would prefix project, meta data used would be mapped to data upload api object, so could be scaled DS: Option 4 is nicer for AIMS, as less code is maintained on AIMS; Polling from CDC; Queue can be monitored, but is available for CDC to poll at any time
DEX: In Option 4, how is Meta Data received? AIMS: Get object command would make meta data available; DEX: Meta Data would be persisted in Azure blob
Leslynn: Are there any programs which APHL would need to pull data back from CDC? Dari: Currently, EIP is the only program which pushes data from CDC, but that would be migrated to CDC Platform intermediate term Leslynn: Total daily volumes--would this be possible to provide per program? AIMS: Preference would be Option 4; Option 1 would need to be coordinated long-term with DEX, so higher LOE
|