Just recently, I discovered the awsrangler
Python package which provides function to easily work with AWS analytics services using pandas
data frames. As this package is not referenced in official documentation and rarely on relevant questions on StackOverflow, you might not stumble upon this package even if you’re searching for solutions well covered by the package.
Besides providing useful functions to retrieve (and write) data frames from S3, Athena, Glue, Redshift and RDS, the maintainers of the package also provide packages/layers that can be used in AWS Lambda and AWS Glue jobs. For more information, see its github page.