-
Notifications
You must be signed in to change notification settings - Fork 3.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Update SeqKit to v2.9.0 #51860
Update SeqKit to v2.9.0 #51860
Conversation
📝 Walkthrough📝 WalkthroughWalkthroughThe pull request involves modifications to the Possibly related PRs
Suggested labels
📜 Recent review detailsConfiguration used: CodeRabbit UI 📒 Files selected for processing (1)
🧰 Additional context used🪛 yamllintrecipes/seqkit/meta.yaml[error] 1-1: syntax error: found character '%' that cannot start any token (syntax) 🔇 Additional comments (3)recipes/seqkit/meta.yaml (3)
The version update to 2.9.0 matches the latest official release from November 1, 2024, which includes important fixes for sequence ID parsing and improvements to various SeqKit commands. 🧰 Tools🪛 yamllint[error] 1-1: syntax error: found character '%' that cannot start any token (syntax)
Resetting the build number to 0 is correct when updating to a new version.
MD5 checksums do not match the official release artifacts. Based on the previous verification results, the MD5 checksums in the recipe do not match the official release artifacts:
Let's verify the current checksums against the official release: #!/bin/bash
# Description: Verify MD5 checksums match the official release artifacts
# Expected: MD5 checksums should match the values in meta.yaml
VERSION="2.9.0"
TEMP_DIR=$(mktemp -d)
cd "$TEMP_DIR" || exit 1
echo "Verifying checksums for SeqKit v${VERSION}..."
# Download and verify each platform artifact
for platform in darwin_arm64 linux_amd64 linux_arm64; do
echo -e "\nPlatform: ${platform}"
curl -sL "https://github.com/shenwei356/seqkit/releases/download/v${VERSION}/seqkit_${platform}.tar.gz" -o "${platform}.tar.gz"
md5sum "${platform}.tar.gz"
done
cd - || exit 1
rm -rf "$TEMP_DIR" Also applies to: 12-12, 15-15, 18-18 Thank you for using CodeRabbit. We offer it for free to the OSS community and would appreciate your support in helping us grow. If you find it useful, would you consider giving us a shout-out on your favorite social media? 🪧 TipsChatThere are 3 ways to chat with CodeRabbit:
Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments. CodeRabbit Commands (Invoked using PR comments)
Other keywords and placeholders
CodeRabbit Configuration File (
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Actionable comments posted: 1
🧹 Outside diff range and nitpick comments (1)
recipes/seqkit/meta.yaml (1)
1-24
: Consider testing backward compatibility.SeqKit v2.9.0 includes several important fixes that affect sequence parsing and handling:
- Modified sequence ID parsing behavior
- Changes to
-G/--non-greedy
option inlocate
command- Updates to negative region handling in
grep/subseq
- New
sum_n
column instats
outputThese changes might affect existing pipelines that depend on the specific behavior of these features.
Consider:
- Testing the package with existing bioinformatics pipelines
- Documenting any behavior changes in the package metadata
- Adding example commands in the test section for the modified features
🧰 Tools
🪛 yamllint
[error] 1-1: syntax error: found character '%' that cannot start any token
(syntax)
📜 Review details
Configuration used: CodeRabbit UI
Review profile: CHILL
📒 Files selected for processing (1)
recipes/seqkit/meta.yaml
(1 hunks)
🧰 Additional context used
🪛 yamllint
recipes/seqkit/meta.yaml
[error] 1-1: syntax error: found character '%' that cannot start any token
(syntax)
🔇 Additional comments (2)
recipes/seqkit/meta.yaml (2)
1-1
: LGTM: Version update matches the release.
The version bump to 2.9.0 correctly reflects the latest SeqKit release from November 1, 2024.
🧰 Tools
🪛 yamllint
[error] 1-1: syntax error: found character '%' that cannot start any token
(syntax)
21-21
: LGTM: Build number reset is correct.
Resetting the build number to 0 is appropriate when updating to a new version.
recipes/seqkit/meta.yaml
Outdated
md5: c83d82188fbd83da765b5ae9939f5079 # [osx] | ||
|
||
url: https://github.com/shenwei356/seqkit/releases/download/v{{ version }}/seqkit_darwin_arm64.tar.gz # [arm64] | ||
md5: 0b2716f39c3974a5c3e310bcbd4076eb # [arm64] | ||
md5: c6ddaa62d73280b645215143087dd1db # [arm64] | ||
|
||
url: https://github.com/shenwei356/seqkit/releases/download/v{{ version }}/seqkit_linux_amd64.tar.gz # [linux] | ||
md5: 67220b508f3f81c2c8697e6534eed440 # [linux] | ||
md5: 5b5c5ba2796d45f42b2e61fde4d79821 # [linux] | ||
|
||
url: https://github.com/shenwei356/seqkit/releases/download/v{{ version }}/seqkit_linux_arm64.tar.gz # [aarch64] | ||
md5: b83b021850c9447cfc8e0d5751b7be99 # [aarch64] | ||
md5: 5cbca93de57c11d1f70d1e713a44be5a # [aarch64] |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
💡 Codebase verification
MD5 checksums do not match the official release artifacts
The verification shows mismatches between the expected and actual MD5 checksums for all platforms:
- darwin_arm64: expected
c6ddaa62d73280b645215143087dd1db
, gotcf1b7c4b74664f04d3cc361e9c5370a1
- linux_amd64: expected
5b5c5ba2796d45f42b2e61fde4d79821
, got83e359c186dee6b49cd4df362d0b6d5a
- linux_arm64: expected
5cbca93de57c11d1f70d1e713a44be5a
, got7a81adcd79553e10552456060c2b17ba
Please update the MD5 checksums in meta.yaml to match the current release artifacts.
🔗 Analysis chain
Verify checksums against official release artifacts.
Let's verify the MD5 checksums match the official SeqKit v2.9.0 release artifacts.
Let me try a different approach using curl
instead of wget
since it seems to be available in the sandbox environment.
🏁 Scripts executed
The following scripts were executed for the analysis:
Script:
#!/bin/bash
# Description: Verify MD5 checksums match the official release artifacts
# Expected: MD5 checksums should match the values in meta.yaml
# Create a temporary directory for downloads
TEMP_DIR=$(mktemp -d)
cd "$TEMP_DIR" || exit 1
# Download and verify each platform artifact
VERSION="2.9.0"
PLATFORMS=("darwin_amd64" "darwin_arm64" "linux_amd64" "linux_arm64")
for platform in "${PLATFORMS[@]}"; do
echo "Verifying $platform..."
wget -q "https://github.com/shenwei356/seqkit/releases/download/v${VERSION}/seqkit_${platform}.tar.gz"
md5sum "seqkit_${platform}.tar.gz"
done
cd - || exit 1
rm -rf "$TEMP_DIR"
Length of output: 1669
Script:
#!/bin/bash
# Description: Verify MD5 checksums match the official release artifacts using curl
# Expected: MD5 checksums should match the values in meta.yaml
# Create a temporary directory for downloads
TEMP_DIR=$(mktemp -d)
cd "$TEMP_DIR" || exit 1
# Download and verify each platform artifact
VERSION="2.9.0"
echo "Verifying darwin_arm64..."
curl -sL "https://github.com/shenwei356/seqkit/releases/download/v${VERSION}/seqkit_darwin_arm64.tar.gz" -o darwin_arm64.tar.gz
echo "c6ddaa62d73280b645215143087dd1db darwin_arm64.tar.gz (expected)"
md5sum darwin_arm64.tar.gz
echo -e "\nVerifying linux_amd64..."
curl -sL "https://github.com/shenwei356/seqkit/releases/download/v${VERSION}/seqkit_linux_amd64.tar.gz" -o linux_amd64.tar.gz
echo "5b5c5ba2796d45f42b2e61fde4d79821 linux_amd64.tar.gz (expected)"
md5sum linux_amd64.tar.gz
echo -e "\nVerifying linux_arm64..."
curl -sL "https://github.com/shenwei356/seqkit/releases/download/v${VERSION}/seqkit_linux_arm64.tar.gz" -o linux_arm64.tar.gz
echo "5cbca93de57c11d1f70d1e713a44be5a linux_arm64.tar.gz (expected)"
md5sum linux_arm64.tar.gz
cd - || exit 1
rm -rf "$TEMP_DIR"
Length of output: 1353
No useful information in the |
|
9234b8b
to
f2ea067
Compare
my bad, thank you @bgruening :) |
Describe your pull request here
Changes
seqkit
:seqkit locate
:-G/--non-greedy
for tandem repeats, e.g., ATTCGATTCGATTCG (ATTCGx3).seqkit grep/subseq
:seqkit stats
:sum_n
to count the number of ambiguous characters. #490Please read the guidelines for Bioconda recipes before opening a pull request (PR).
General instructions
@BiocondaBot please add label
command.@bioconda/core
in a comment.Instructions for avoiding API, ABI, and CLI breakage issues
Conda is able to record and lock (a.k.a. pin) dependency versions used at build time of other recipes.
This way, one can avoid that expectations of a downstream recipe with regards to API, ABI, or CLI are violated by later changes in the recipe.
If not already present in the meta.yaml, make sure to specify
run_exports
(see here for the rationale and comprehensive explanation).Add a
run_exports
section like this:with
...
being one of:{{ pin_subpackage("myrecipe", max_pin="x") }}
{{ pin_subpackage("myrecipe", max_pin="x.x") }}
{{ pin_subpackage("myrecipe", max_pin="x.x") }}
(in such a case, please add a note that shortly mentions your evidence for that){{ pin_subpackage("myrecipe", max_pin="x.x.x") }}
(in such a case, please add a note that shortly mentions your evidence for that){{ pin_subpackage("myrecipe", max_pin=None) }}
while replacing
"myrecipe"
with eithername
if aname|lower
variable is defined in your recipe or with the lowercase name of the package in quotes.Bot commands for PR management
Please use the following BiocondaBot commands:
Everyone has access to the following BiocondaBot commands, which can be given in a comment:
@BiocondaBot please update
@BiocondaBot please add label
please review & merge
label.@BiocondaBot please fetch artifacts
You can use this to test packages locally.
Note that the
@BiocondaBot please merge
command is now depreciated. Please just squash and merge instead.Also, the bot watches for comments from non-members that include
@bioconda/<team>
and will automatically re-post them to notify the addressed<team>
.