Amazon Web Services recently introduced AWS S3 Files, a service that addresses a persistent challenge in cloud computing - how to give file-based applications direct access to object storage without duplicating data or building custom connectors.

The Problem S3 Files Solves

Traditionally, applications designed around file systems faced a difficult choice when working with Amazon S3:

  1. Use object APIs - Build custom integration code and refactor applications
  2. Duplicate data - Copy data between S3 and separate file systems, creating sync challenges and increased costs
  3. Accept performance trade-offs - Work with slower, network-dependent access patterns

S3 Files eliminates these constraints by providing a native file system interface directly over S3 data.

What Is AWS S3 Files?

S3 Files is a shared file system that connects AWS compute resources directly to S3 data. It allows applications to access objects as files and folders using standard file system tools - Python libraries, machine learning frameworks, CLI utilities - without custom APIs or data duplication.

The key innovation: your data remains in S3 while being simultaneously accessible through both file and object interfaces.

Key Features and Capabilities

High Performance Access

S3 Files delivers low latency and up to multiple terabytes per second of aggregate read throughput. It intelligently manages the transition between active and inactive data:

  • Up to 10M+ file system IOPS per bucket
  • Automatic intelligent caching of your working set
  • Automatic expiration of unused data

Massive Concurrency

The service supports 25,000+ concurrent compute resources accessing the same S3 file system simultaneously, enabling coordinated workflows across large distributed systems.

Single Copy Storage

Unlike solutions that require synchronization between storage systems, S3 Files maintains data in one place. Your information persists in S3’s durable, scalable infrastructure while performance storage handles active workloads.

Cost Efficiency

Organizations can achieve up to 90% cost savings compared to traditional approaches of cycling data between S3 and separate file systems, since there’s no duplication and no separate storage tier to maintain.

Ideal Use Cases

Machine Learning Workflows Frameworks like TensorFlow and PyTorch can directly access training data stored in S3 without preprocessing or data duplication.

Data Analysis at Scale Analytics tools and SQL engines can query massive datasets in S3 using familiar file system operations.

Batch Processing Large-scale data processing jobs can work directly with S3 data using standard file interfaces.

Collaborative Computing Multiple compute instances can simultaneously access and process the same dataset in S3.

Legacy Application Migration File system - dependent applications can move to AWS without architectural redesign.

How It Works

S3 Files sits between your compute resources and S3 storage. When your application requests a file:

  1. The request reaches S3 Files
  2. If the data is in the high-performance cache layer, it’s returned immediately
  3. If not, S3 Files fetches it from S3
  4. Inactive data is automatically evicted from the cache to optimize costs
  5. All changes persist to S3

This architecture ensures you get file system performance while maintaining S3’s durability, scalability, and cost benefits.

Getting Started

To use S3 Files, you’ll need:

  • An AWS account with access to the service
  • Compute resources in the same region as your S3 bucket
  • Standard AWS permissions to access your S3 data
  • Applications or frameworks that support file system interfaces

AWS provides documentation and examples for common use cases including machine learning training, data processing, and analytics workloads.

The Broader Impact

S3 Files represents a shift in how cloud storage can be consumed. Rather than forcing applications into a single paradigm - object APIs or file systems - it allows both interfaces to coexist, giving developers more flexibility in how they build and migrate applications.

For organizations with significant investments in file system - based tools and workflows, S3 Files removes a major blocker to cloud adoption. For cloud - native applications, it provides a compelling alternative to maintaining separate file storage infrastructure.

Learn More


Have you tried S3 Files or similar object storage with file system interfaces? Share your experience in the comments below.