Skip to content

Latest commit

 

History

History
193 lines (130 loc) · 17.2 KB

CONTRIBUTING.md

File metadata and controls

193 lines (130 loc) · 17.2 KB

Contributing to OpenSearch

OpenSearch is a community project built and maintained by people just like you. We're glad you're interested in helping out. There are several different ways you can do it, but before we talk about that, let's talk about how to get started.

First Things First

  1. When in doubt, open an issue - For almost any type of contribution, the first step is opening an issue. Even if you think you already know what the solution is, writing down a description of the problem you're trying to solve will help everyone get context when they review your pull request. If it's truly a trivial change (e.g. spelling error), you can skip this step -- but as the subject says, when in doubt, open an issue.

  2. Only submit your own work (or work you have sufficient rights to submit) - Please make sure that any code or documentation you submit is your work or you have the rights to submit. We respect the intellectual property rights of others, and as part of contributing, we'll ask you to sign your contribution with a "Developer Certificate of Origin" (DCO) that states you have the rights to submit this work and you understand we'll use your contribution. There's more information about this topic in the DCO section.

Ways to Contribute

Please note: OpenSearch is a fork of Elasticsearch 7.10.2. If you do find references to Elasticsearch (outside of attributions and copyrights!) please open an issue.

Bug Reports

Ugh! Bugs!

A bug is when software behaves in a way that you didn't expect and the developer didn't intend. To help us understand what's going on, we first want to make sure you're working from the latest version. Please make sure you're testing against the latest version.

Once you've confirmed that the bug still exists in the latest version, you'll want to check the bug is not something we already know about. A good way to figure this out is to search for your bug on the open issues GitHub page.

If you've upgraded to the latest version and you can't find it in our open issues list, then you'll need to tell us how to reproduce it. To make the behavior as clear as possible, please provide your steps as curl commands which we can copy and paste into a terminal to run it locally, for example:

# delete the index
curl -X DELETE localhost:9200/test

# insert a document
curl -x PUT localhost:9200/test/test/1 -d '{
 "title": "test document"
}'

# this should return XXXX but instead returns YYYY
curl ....

Provide as much information as you can. You may think that the problem lies with your query, when actually it depends on how your data is indexed. The easier it is for us to recreate your problem, the faster it is likely to be fixed. It is generally always helpful to provide the basic details of your cluster configuration alongside your reproduction steps.

Feature Requests

If you've thought of a way that OpenSearch could be better, we want to hear about it. We track feature requests using GitHub, so please feel free to open an issue which describes the feature you would like to see, why you need it, and how it should work. After opening an issue, the fastest way to see your change made is to open a pull request following the requested changes you detailed in your issue. You can learn more about opening a pull request in the contributing code section.

Documentation Changes

If you would like to contribute to the documentation, please do so in the documentation-website repo. To contribute javadocs, please first check OpenSearch#221.

Contributing Code

As with other types of contributions, the first step is to open an issue on GitHub. Opening an issue before you make changes makes sure that someone else isn't already working on that particular problem. It also lets us all work together to find the right approach before you spend a bunch of time on a PR. So again, when in doubt, open an issue.

Additionally, here are a few guidelines to help you decide whether a particular feature should be included in OpenSearch.

Is your feature important to most users of OpenSearch?

If you believe that a feature is going to fulfill a need for most users of OpenSearch, then it belongs in OpenSearch. However, we don't want every feature built into the core server. If the feature requires additional permissions or brings in extra dependencies it should instead be included as a module in core.

Is your feature a common dependency across multiple plugins?

Does this feature contain functionality that cuts across multiple plugins? If so, this most likely belongs in OpenSearch as a core module or plugin.

Once you've opened an issue, check out our Developer Guide for instructions on how to get started.

Developer Certificate of Origin

OpenSearch is an open source product released under the Apache 2.0 license (see either the Apache site or the LICENSE.txt file). The Apache 2.0 license allows you to freely use, modify, distribute, and sell your own products that include Apache 2.0 licensed software.

We respect intellectual property rights of others and we want to make sure all incoming contributions are correctly attributed and licensed. A Developer Certificate of Origin (DCO) is a lightweight mechanism to do that.

The DCO is a declaration attached to every contribution made by every developer. In the commit message of the contribution, the developer simply adds a Signed-off-by statement and thereby agrees to the DCO, which you can find below or at DeveloperCertificate.org.

Developer's Certificate of Origin 1.1

By making a contribution to this project, I certify that:

(a) The contribution was created in whole or in part by me and I
    have the right to submit it under the open source license
    indicated in the file; or

(b) The contribution is based upon previous work that, to the
    best of my knowledge, is covered under an appropriate open
    source license and I have the right under that license to
    submit that work with modifications, whether created in whole
    or in part by me, under the same open source license (unless
    I am permitted to submit under a different license), as
    Indicated in the file; or

(c) The contribution was provided directly to me by some other
    person who certified (a), (b) or (c) and I have not modified
    it.

(d) I understand and agree that this project and the contribution
    are public and that a record of the contribution (including
    all personal information I submit with it, including my
    sign-off) is maintained indefinitely and may be redistributed
    consistent with this project or the open source license(s)
    involved.

We require that every contribution to OpenSearch is signed with a Developer Certificate of Origin. Additionally, please use your real name. We do not accept anonymous contributors nor those utilizing pseudonyms.

Each commit must include a DCO which looks like this

Signed-off-by: Jane Smith <[email protected]>

You may type this line on your own when writing your commit messages. However, if your user.name and user.email are set in your git configs, you can use -s or --signoff to add the Signed-off-by line to the end of the commit message.

Changelog

OpenSearch maintains version specific changelog by enforcing a change to the ongoing CHANGELOG file adhering to the Keep A Changelog format. The purpose of the changelog is for the contributors and maintainers to incrementally build the release notes throughout the development process to avoid a painful and error-prone process of attempting to compile the release notes at release time. On each release the "unreleased" entries of the changelog are moved to the appropriate release notes document in the ./release-notes folder. Also, incrementally building the changelog provides a concise, human-readable list of significant features that have been added to the unreleased version under development.

Which changes require a CHANGELOG entry?

Changelogs are intended for operators/administrators, developers integrating with libraries and APIs, and end-users interacting with OpenSearch Dashboards and/or the REST API (collectively referred to as "user"). In short, any change that a user of OpenSearch might want to be aware of should be included in the changelog. The changelog is not intended to replace the git commit log that developers of OpenSearch itself rely upon. The following are some examples of changes that should be in the changelog:

  • A newly added feature
  • A fix for a user-facing bug
  • Dependency updates
  • Fixes for security issues

The following are some examples where a changelog entry is not necessary:

  • Adding, modifying, or fixing tests
  • An incremental PR for a larger feature (such features should include one changelog entry for the feature)
  • Documentation changes or code refactoring
  • Build-related changes

Any PR that does not include a changelog entry will result in a failure of the validation workflow in GitHub. If the contributor and maintainers agree that no changelog entry is required, then the skip-changelog label can be applied to the PR which will result in the workflow passing.

How to add my changes to CHANGELOG?

Adding in the change is two step process:

  1. Add your changes to the corresponding section within the CHANGELOG file with dummy pull request information, publish the PR
  2. Update the entry for your change in CHANGELOG.md and make sure that you reference the pull request there.

Where should I put my CHANGELOG entry?

Please review the branching strategy document. The changelog on the main branch will contain two files: CHANGELOG.md which corresponds to unreleased changes intended for the next minor release and CHANGELOG-3.0.md which correspond to unreleased changes intended for the next major release. Your entry should go into file corresponding to the version it is intended to be released in. In practice, most changes to main will be backported to the next minor release so most entries will be in the CHANGELOG.md file.

The following examples assume the next major release on main is 3.0, then next minor release is 2.5, and the current release is 2.4.

  • Add a new feature to release in next minor: Add a changelog entry to [Unreleased 2.x] in CHANGELOG.md on main, then backport to 2.x (including the changelog entry).
  • Introduce a breaking API change to release in next major: Add a changelog entry to [Unreleased 3.0] to CHANGELOG-3.0.md on main, do not backport.
  • Upgrade a dependency to fix a CVE: Add a changelog entry to [Unreleased 2.x] on main, then backport to 2.x (including the changelog entry), then backport to 2.4 and ensure the changelog entry is added to [Unreleased 2.4.1].

Review Process

We deeply appreciate everyone who takes the time to make a contribution. We will review all contributions as quickly as possible. As a reminder, opening an issue discussing your change before you make it is the best way to smooth the PR process. This will prevent a rejection because someone else is already working on the problem, or because the solution is incompatible with the architectural direction.

During the PR process, expect that there will be some back-and-forth. Please try to respond to comments in a timely fashion, and if you don't wish to continue with the PR, let us know. If a PR takes too many iterations for its complexity or size, we may reject it. Additionally, if you stop responding we may close the PR as abandoned. In either case, if you feel this was done in error, please add a comment on the PR.

If we accept the PR, a maintainer will merge your change and usually take care of backporting it to appropriate branches ourselves.

If we reject the PR, we will close the pull request with a comment explaining why. This decision isn't always final: if you feel we have misunderstood your intended change or otherwise think that we should reconsider then please continue the conversation with a comment on the PR and we'll do our best to address any further points you raise.

Tips for Success (#tips)

We have a lot of mechanisms to help expedite towards an accepted PR. Here are some tips for success:

  1. Minimize BWC guarantees: The first PR review cycle heavily focuses on the public facing APIs. This is what we have to "guarantee" as non-breaking for bwc across major versions.
  2. Do not copy non-compliant code: Ensure that code is APLv2 compatible. This means that you have not copied any code from other sources unless that code is also APLv2 compatible.
  3. Utilize feature flags: Features that are safeguarded behind feature flags are more likely to be merged and backported, as they come with an additional layer of protection. Refer to this example PR for implementation details.
  4. Use appropriate Java tags:
    • @opensearch.internal: Marks internal classes subject to rapid changes.
    • @opensearch.api: Marks public-facing API classes with backward compatibility guarantees.
    • @opensearch.experimental: Indicates rapidly changing experimental code.
  5. Employ sandbox for significant core changes: Any new features or enhancements that make changes to core classes (e.g., search phases, codecs, or specialized lucene APIs) are more likely to. be merged if they are sandboxed. This can only be enabled on the java CLI (-Dsandbox.enabled=true).
  6. Micro-benchmark critical path: This is a lesser known mechanism, but if you have critical path changes you're afraid will impact performance (the changes touch the garbage collector, heap, direct memory, or CPU) then including a microbenchmark with your PR (and jfr or flamegraph results in the description) is a GREAT IDEA and will help expedite the review process.
  7. Test rigorously: Ensure thorough testing (OpenSearchTestCase for unit tests, OpenSearchIntegTestCase for integration & cluster tests, OpenSearchRestTestCase for testing REST endpoint interfaces, and yaml tests with ClientYamlTestSuiteIT for REST integration tests)

In general, adding more guardrails to your changes increases the likelihood of swift PR acceptance. We can always relax these guard rails in smaller followup PRs. Reverting a GA feature is much more difficult. Check out the DEVELOPER_GUIDE for more useful tips.

Troubleshooting Failing Builds

The OpenSearch testing framework offers many capabilities but exhibits significant complexity (it does lot of randomization internally to cover as many edge cases and variations as possible). Unfortunately, this posses a challenge by making it harder to discover important issues/bugs in straightforward way and may lead to so called flaky tests - the tests which flip randomly from success to failure without any code changes.

If your pull request reports a failing test(s) on one of the checks, please:

  • look if there is an existing issue reported for the test in question
  • if not, please make sure this is not caused by your changes, run the failing test(s) locally for some time
  • if you are sure the failure is not related, please open a new bug with flaky-test label
  • add a comment referencing the issue(s) or bug report(s) to your pull request explaining the failing build(s)
  • as a bonus point, try to contribute by fixing the flaky test(s)