S3-FileSystem

Introduction

S3-FileSystem is an implementation of the Hadoop file system contract backed by AWS S3.

For a details on configuration see our usage guide.

Goals

S3-FileSystem was created to enable a more efficient usage of AWS S3. This means:

provide strong read after write consistency (in the meantime AWS has also rolled out native s3 strong consystency).
provide file rename as an atomic O(1) operation. Natively, files cannot be renamed in S3(other file system implementations on top of S3 implement file rename as a copy + delete).
avoid S3 partition hotspot problem regardless of client defined file paths.

Non-Goals

S3-FileSystem does not aim be a drop in replacement for HDFS nor to fully implement the FileSystem specification. There are differences between HDFS and S3-FileSystem, most notably:

S3-FileSystem does not support atomic rename of directories.
S3-FileSystem does not support POSIX like permissions.

For a full list of differences between S3-FileSystem and the Hadoop API specification see our contract definition and our API compatibility analysis.

For the full Hadoop API specification please see these docs. For the implicit assumptions(including atomicity and concurrency) of the API please see these docs.

Similar projects

A few projects that tackle the same issues:

S3 Guard tackles S3 consistency issues:
- Since S3 rolled out native strong consistency, the open source community has decided to deprecate S3 Guard.
S3A committers tackles both consistency and S3's rename problems
- The S3A committers do not attempt to solve these issues at the FileSystem level, but at the OutputCommitter level. Thus, they are primarily targeted at improving Spark/MR job performance and correctness when running on S3.

Contributing

Contributions are welcomed! Read the Contributing Guide for more information.

Licensing

This project is licensed under the Apache V2 License. See LICENSE for more information.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.github/workflows		.github/workflows
docs		docs
gradle		gradle
src		src
.gitignore		.gitignore
AUTHORS		AUTHORS
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
README.md		README.md
build.gradle		build.gradle
gradlew		gradlew
gradlew.bat		gradlew.bat
release.sh		release.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

S3-FileSystem

Introduction

Goals

Non-Goals

Similar projects

Contributing

Licensing

About

Releases 1

Packages

Languages

License

adobe/S3-FileSystem

Folders and files

Latest commit

History

Repository files navigation

S3-FileSystem

Introduction

Goals

Non-Goals

Similar projects

Contributing

Licensing

About

Topics

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages