Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

This documentation focuses on access to s3 using Python libraries, specifically on the use of boto3 library, and includes a code sample, split into various steps.

Table of Contents

Step 0: Install package in Python environment

In order to run the following example you need to have the following packages in your environment:

...

Now that you have an environment with the required packages and Python version, we can move to prepare and run the actual code.

Step 1: Configure client

Depending on whether the bucket you want to access is private (needs credentials) or public (no credentials needed), choose one of the following..

To access private buckets that require S3 credentials

Make sure you have both Python 3 and the access keys to your S3 bucket ready. Typically you'll find your access key and secret key in Morpheus under Tools -> Cypher. 

...

Code Block
languagepy
import os
import io
import boto3


# Initializing variables for the client 
S3_BUCKET_NAME = "MyFancyBucket123'"  #Fill this in  
S3_ACCESS_KEY = "123asdf"  #Fill this in  
S3_SECRET_ACCESS_KEY = "123asdf111"  #Fill this in  
S3_ENDPOINT_URL = "https://my-s3-endpoint.com"  #Fill this in   

...

Code Block
languagepy
# Initialize the S3 client
s3 = boto3.client(
	's3',
	endpoint_url=S3_ENDPOINT_URL,
    aws_access_key_id=S3_ACCESS_KEY,
    aws_secret_access_key=S3_SECRET_ACCESS_KEY
)


To access public buckets (no credentials required)

Code Block
languagepy
import os
import io
import boto3

from botocore import UNSIGNED
from botocore.config import Config   

# Initializing variables for the client  
S3_BUCKET_NAME = "MyFancyBucket123"  #Fill this in  
S3_ENDPOINT_URL = "https://my-s3-endpoint.com"  #Fill this in

...

Code Block
languagepy
# Initialize the S3 client
s3 = boto3.client(
	's3',
	endpoint_url=S3_ENDPOINT_URL,
    config=Config(
		signature_version=UNSIGNED
	))


Step 2: Perform actions

Case 1: List objects

As a first step, and to confirm we have successfully connected, lets list the objects inside our bucket (up to a 1.000 objects). 

...

Code Block
languagejs
{'Key': 'MyFile.txt', 'LastModified': datetime.datetime(2021, 11, 11, 0, 39, 23, 320000, tzinfo=tzlocal()), 'ETag': '"2e22f62675cea3445f7e24818a4f6ba0d6-1"', 'Size': 1013, 'StorageClass': 'STANDARD'}


Case 2: Read objects

Into memory

Now lets try to read a file from a bucket into Python's memory, so we can work with it inside Python without ever saving the file to our local computer:

Code Block
languagepy
#Read a file into Python's memory and open it as a string
FILENAME = '/folder1/folder2/myfile.txt'  #Fill this in  
obj = s3.get_object(Bucket=S3_BUCKET_NAME, Key=FILENAME)
myObject = obj['Body'].read().decode('utf-8') 
print(myObject)

Download objects to a file

But if you'd want to download the file instead of reading it into memory (e.g. so you can use it with other libraries or applications that expect files), here's how you'd do that:

Code Block
languagepy
# Downloading a file from the bucket
with open('myfile', 'wb') as f:  #Fill this in  
    s3.download_fileobj(S3_BUCKET_NAME, 'myfile', f) 


Case 3: Upload objects

And similarly you can upload files to the bucket (given that you have write access to the bucket):

Code Block
languagepy
# Uploading a file to the bucket (make sure you have write access)
response = s3.upload_file('myfile', S3_BUCKET_NAME, 'myfile')  #Fill this in  


Case 4: Create a bucket

And lastly, creating a bucket (this could take some time):

...

  • upload a file into the new bucket (case 3)
  • list the contents of the bucket to verify your file is there (case 1)
  • download the file you uploaded (case 2)

Related examples

Other Resources

...