Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: add workflow and script to check edit links on docs #3557

Open
wants to merge 14 commits into
base: master
Choose a base branch
from

Conversation

anshgoyalevil
Copy link
Member

@anshgoyalevil anshgoyalevil commented Jan 8, 2025

Description

  • This PR adds a script that programatically creates the doc URLs and fetch their head to check if their exist an edit link on AsyncAPI website.
  • The workflow would run once a week at midnight.
  • If their were any errors, the workflow would dump the error to Slack. Example message: https://asyncapi.slack.com/archives/C0884HHQZS5/p1736348664768859

Related issue(s)
fixes #3221

Summary by CodeRabbit

Release Notes

  • New Features

    • Added a weekly automated link checker to verify documentation links.
    • Introduced a new script to check and validate edit links in markdown files.
  • Improvements

    • Enhanced edit link generation for documentation pages.
    • Updated documentation link configurations.
  • Testing

    • Added a comprehensive test suite for link checking functionality.
    • Improved test coverage for URL validation processes.
  • Chores

    • Updated GitHub Actions workflow.
    • Minor configuration file adjustments.

Copy link

coderabbitai bot commented Jan 8, 2025

Walkthrough

A comprehensive enhancement to the documentation link checking and editing workflow has been implemented. The changes introduce a new GitHub Actions workflow for weekly link validation, modify the edit link generation logic in the DocsLayout component, update configuration files, and add a new script for checking edit links. The implementation includes robust error handling, path resolution, and URL validation mechanisms to ensure the accuracy of documentation edit links across the repository.

Changes

File Change Summary
.github/workflows/check-edit-links.yml New workflow added for weekly documentation link checking
components/layout/DocsLayout.tsx Modified generateEditLink function to handle .mdx to .md link conversion
config/edit-page-config.json Updated GitHub documentation URL path
package.json Added new script test:editlinks for link checking
jest.config.js Minor formatting change
tests/fixtures/markdown/check-editlinks-data.js Added test data constants for link checking
scripts/dashboard/build-dashboard.js Exported pause function
scripts/markdown/check-edit-links.js New script for comprehensive edit link validation
tests/markdown/check-edit-links.test.js Added comprehensive test suite for link checking functionality

Assessment against linked issues

Objective Addressed Explanation
Fix incorrect "Edit this page on GitHub" links
Validate documentation edit links

Poem

🐰 Hop, hop, through markdown's maze,
Links checked with algorithmic praise,
No broken paths shall make us fret,
Our docs now shine, a perfect set!
Rabbit's code, a linking delight! 🔗

Finishing Touches

  • 📝 Generate Docstrings (Beta)

Thank you for using CodeRabbit. We offer it for free to the OSS community and would appreciate your support in helping us grow. If you find it useful, would you consider giving us a shout-out on your favorite social media?

❤️ Share
🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

  • Review comments: Directly reply to a review comment made by CodeRabbit. Example:
    • I pushed a fix in commit <commit_id>, please review it.
    • Generate unit testing code for this file.
    • Open a follow-up GitHub issue for this discussion.
  • Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
    • @coderabbitai generate unit testing code for this file.
    • @coderabbitai modularize this function.
  • PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
    • @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
    • @coderabbitai read src/utils.ts and generate unit testing code.
    • @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
    • @coderabbitai help me debug CodeRabbit configuration file.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

  • @coderabbitai pause to pause the reviews on a PR.
  • @coderabbitai resume to resume the paused reviews.
  • @coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
  • @coderabbitai full review to do a full review from scratch and review all the files again.
  • @coderabbitai summary to regenerate the summary of the PR.
  • @coderabbitai generate docstrings to generate docstrings for this PR. (Beta)
  • @coderabbitai resolve resolve all the CodeRabbit review comments.
  • @coderabbitai configuration to show the current CodeRabbit configuration for the repository.
  • @coderabbitai help to get help.

Other keywords and placeholders

  • Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
  • Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
  • Add @coderabbitai anywhere in the PR title to generate the title automatically.

Documentation and Community

  • Visit our Documentation for detailed information on how to use CodeRabbit.
  • Join our Discord Community to get help, request features, and share feedback.
  • Follow us on X/Twitter for updates and announcements.

Copy link

netlify bot commented Jan 8, 2025

Deploy Preview for asyncapi-website ready!

Built without sensitive environment variables

Name Link
🔨 Latest commit 22a31ce
🔍 Latest deploy log https://app.netlify.com/sites/asyncapi-website/deploys/678692f40c88d90008103fd5
😎 Deploy Preview https://deploy-preview-3557--asyncapi-website.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

@asyncapi-bot
Copy link
Contributor

asyncapi-bot commented Jan 8, 2025

⚡️ Lighthouse report for the changes in this PR:

Category Score
🔴 Performance 38
🟢 Accessibility 98
🟢 Best practices 92
🟢 SEO 100
🔴 PWA 33

Lighthouse ran on https://deploy-preview-3557--asyncapi-website.netlify.app/

Copy link

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 2

🧹 Nitpick comments (2)
.github/workflows/check-edit-links.yml (2)

32-37: Improve shell script robustness.

The error extraction step could be more robust by following shell scripting best practices.

 - name: Extract 404 URLs from output
   id: extract-404
   run: |
-    ERRORS=$(sed -n '/URLs returning 404:/,$p' output.log)
-    echo "errors<<EOF" >> $GITHUB_OUTPUT
-    echo "$ERRORS" >> $GITHUB_OUTPUT
-    echo "EOF" >> $GITHUB_OUTPUT
+    {
+      ERRORS="$(sed -n '/URLs returning 404:/,$p' output.log)"
+      echo "errors<<EOF"
+      echo "${ERRORS}"
+      echo "EOF"
+    } >> "$GITHUB_OUTPUT"
🧰 Tools
🪛 actionlint (1.7.4)

32-32: shellcheck reported issue in this script: SC2129:style:2:1: Consider using { cmd1; cmd2; } >> file instead of individual redirects

(shellcheck)


32-32: shellcheck reported issue in this script: SC2086:info:2:23: Double quote to prevent globbing and word splitting

(shellcheck)


32-32: shellcheck reported issue in this script: SC2086:info:3:19: Double quote to prevent globbing and word splitting

(shellcheck)


32-32: shellcheck reported issue in this script: SC2086:info:4:15: Double quote to prevent globbing and word splitting

(shellcheck)


38-49: Consider adding error categorization in Slack notification.

The Slack notification could be more informative by categorizing errors.

 SLACK_MESSAGE: |
   🚨 The following URLs returned 404 during the link check:

${{ steps.extract-404.outputs.errors }}

+  Note: These errors might be due to:
+  - Missing documentation files
+  - Incorrect edit link paths
+  - Repository structure changes
📜 Review details

Configuration used: .coderabbit.yaml
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 4d002ce and 689a861.

📒 Files selected for processing (5)
  • .github/workflows/check-edit-links.yml (1 hunks)
  • components/layout/DocsLayout.tsx (1 hunks)
  • config/edit-page-config.json (2 hunks)
  • package.json (1 hunks)
  • scripts/markdown/check-editlinks.js (1 hunks)
🧰 Additional context used
🪛 eslint
scripts/markdown/check-editlinks.js

[error] 58-58: Unexpected await inside a loop.

(no-await-in-loop)


[error] 59-59: Unexpected await inside a loop.

(no-await-in-loop)

🪛 actionlint (1.7.4)
.github/workflows/check-edit-links.yml

32-32: shellcheck reported issue in this script: SC2129:style:2:1: Consider using { cmd1; cmd2; } >> file instead of individual redirects

(shellcheck)


32-32: shellcheck reported issue in this script: SC2086:info:2:23: Double quote to prevent globbing and word splitting

(shellcheck)


32-32: shellcheck reported issue in this script: SC2086:info:3:19: Double quote to prevent globbing and word splitting

(shellcheck)


32-32: shellcheck reported issue in this script: SC2086:info:4:15: Double quote to prevent globbing and word splitting

(shellcheck)

⏰ Context from checks skipped due to timeout of 180000ms (1)
  • GitHub Check: Lighthouse CI
🔇 Additional comments (4)
components/layout/DocsLayout.tsx (1)

37-39: Ensure correct conversion of .mdx to .md in edit links

By replacing .mdx with .md in edit links, please verify that all .mdx files correspond to .md files in the repository. Ensure that this change does not introduce broken edit links for any files that should remain as .mdx.

config/edit-page-config.json (1)

4-4: Verify the updated href path for the AsyncAPI Generator docs

The href for /tools/generator has been updated to a new path. Please confirm that https://github.com/asyncapi/generator/tree/master/apps/generator/docs is the correct location for the generator's documentation and that the link is accessible.

.github/workflows/check-edit-links.yml (1)

3-7: LGTM! Well-configured workflow triggers.

The workflow is appropriately scheduled to run weekly and includes manual trigger capability.

package.json (1)

27-27: Verify the edit links checker script and its dependencies.

The script addition looks good, but let's ensure the required files and dependencies are in place.

✅ Verification successful

✓ Edit links checker implementation is complete and properly configured

All required dependencies and configuration files are in place:

  • node-fetch-2 is available in package.json
  • edit-page-config.json exists with valid mappings
🏁 Scripts executed

The following scripts were executed for the analysis:

Script:

#!/bin/bash
# Description: Verify the existence of the edit links checker script and related files

# Check if the script file exists
if ! [ -f "scripts/markdown/check-editlinks.js" ]; then
  echo "Error: check-editlinks.js script is missing"
  exit 1
fi

# Check for required dependencies in package.json
rg "scripts/markdown/check-editlinks.js" -l

Length of output: 105


Script:

#!/bin/bash
# Check script content and dependencies
cat scripts/markdown/check-editlinks.js

# Also check package.json for any related dependencies
cat package.json | jq '.dependencies, .devDependencies'

Length of output: 9008


Script:

#!/bin/bash
# Check if the config file exists
if [ -f "config/edit-page-config.json" ]; then
    echo "Config file exists"
    cat config/edit-page-config.json
else
    echo "Config file missing"
fi

Length of output: 705

Comment on lines 58 to 59
const batchResults = await processBatch(batch);
await pause(1000);
Copy link

@coderabbitai coderabbitai bot Jan 8, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🛠️ Refactor suggestion

Consider refactoring to avoid 'await' inside a loop

Static analysis tools have flagged the use of await inside a loop on lines 58-59. Using await in a loop can lead to sequential execution and potential performance issues. Consider refactoring the code to process batches concurrently.

Apply this diff to refactor the code:

-  for (let i = 0; i < paths.length; i += batchSize) {
-    console.log(`Processing batch ${Math.floor(i / batchSize) + 1}/${Math.ceil(paths.length / batchSize)}`);
-    const batch = paths.slice(i, i + batchSize);
-    const batchResults = await processBatch(batch);
-    await pause(1000);
-
-    // Filter out null results and add valid URLs to results
-    result.push(...batchResults.filter((url) => url !== null));
-  }
+  const batches = [];
+  for (let i = 0; i < paths.length; i += batchSize) {
+    const batch = paths.slice(i, i + batchSize);
+    batches.push(batch);
+  }
+
+  console.log(`Processing ${batches.length} batches concurrently...`);
+  const batchResultsArray = await Promise.all(
+    batches.map(async (batch) => {
+      const batchResults = await processBatch(batch);
+      // Introduce a pause between batches if necessary
+      await pause(1000);
+      return batchResults.filter((url) => url !== null);
+    })
+  );
+
+  // Flatten the array of results
+  result.push(...batchResultsArray.flat());
📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change
const batchResults = await processBatch(batch);
await pause(1000);
const batches = [];
for (let i = 0; i < paths.length; i += batchSize) {
const batch = paths.slice(i, i + batchSize);
batches.push(batch);
}
console.log(`Processing ${batches.length} batches concurrently...`);
const batchResultsArray = await Promise.all(
batches.map(async (batch) => {
const batchResults = await processBatch(batch);
// Introduce a pause between batches if necessary
await pause(1000);
return batchResults.filter((url) => url !== null);
})
);
// Flatten the array of results
result.push(...batchResultsArray.flat());
🧰 Tools
🪛 eslint

[error] 58-58: Unexpected await inside a loop.

(no-await-in-loop)


[error] 59-59: Unexpected await inside a loop.

(no-await-in-loop)

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We already use such a logic inside our Dashboard script and it works great. let's keep a uniform style

Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@anshgoyalevil Understood, I'll respect the existing pattern used in the Dashboard script. Thanks for the clarification!

(`・ω・´)ゞ


✏️ Learnings added
Learnt from: anshgoyalevil
PR: asyncapi/website#3557
File: scripts/markdown/check-editlinks.js:58-59
Timestamp: 2025-01-08T15:15:00.620Z
Learning: In the AsyncAPI codebase, batch processing operations (like in the Dashboard script and check-editlinks.js) follow a sequential pattern using await in loops, which is the preferred approach for maintaining consistency across the codebase.

Note: Learnings are effective only in the context of similar code segments. To apply general coding guidelines, please configure review instructions. You can manage existing learnings in the app.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@anshgoyalevil The pattern used in the dashboard script is not appropriate. I think CodeRabbitAI has suggested a more configurable approach where you can change the size of the batch, using a variable. Try to implement this suggestion.

Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Seems like the humans are having a chat. I'll hop back into my burrow for now. If you need me again, just tag @coderabbitai in a new comment, and I'll come hopping out!

.github/workflows/check-edit-links.yml Show resolved Hide resolved
Copy link

codecov bot commented Jan 8, 2025

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 100.00%. Comparing base (78c6f2d) to head (22a31ce).

Additional details and impacted files
@@            Coverage Diff            @@
##            master     #3557   +/-   ##
=========================================
  Coverage   100.00%   100.00%           
=========================================
  Files           19        20    +1     
  Lines          668       731   +63     
=========================================
+ Hits           668       731   +63     

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

jest.config.js Outdated
'scripts/compose.js',
'scripts/tools/categorylist.js',
'scripts/tools/tags-color.js',
'scripts/markdown/check-editlinks.js'
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I have added the new script to ignore Jest coverage to pass CI. Will create a new good first issue for other contributors to add test

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think we should have a practice to add test along with the code, so don't add the file here. Instead add the relevant tests for the file.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

uhm okay. will add it

Copy link

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 1

📜 Review details

Configuration used: .coderabbit.yaml
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 689a861 and c13eb87.

📒 Files selected for processing (1)
  • jest.config.js (1 hunks)
⏰ Context from checks skipped due to timeout of 180000ms (3)
  • GitHub Check: Redirect rules - asyncapi-website
  • GitHub Check: Header rules - asyncapi-website
  • GitHub Check: Pages changed - asyncapi-website
🔇 Additional comments (1)
jest.config.js (1)

14-14: LGTM!

The testMatch pattern correctly excludes Netlify edge function tests while including other test files.

jest.config.js Outdated Show resolved Hide resolved
@asyncapi-bot asyncapi-bot added the bounty AsyncAPI Bounty program related label label Jan 9, 2025
@aeworxet
Copy link
Contributor

aeworxet commented Jan 9, 2025

@asyncapi/bounty_team


if (invalidUrls.length === 0) {
console.log('All URLs are valid.');
process.exit(0);
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We shouldn't use process.exit functions like this. Make the conditional rendering more appropriate. Like, make this if block for invalidUrls only.

Copy link

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 2

🧹 Nitpick comments (3)
tests/fixtures/markdown/check-editlinks-data.js (1)

3-5: Ensure consistency in URL path formats

The URL paths in the test data are inconsistent - some start with '/' while others don't. This inconsistency could lead to issues in path matching and URL generation.

Apply this diff to maintain consistency:

-    urlPath: 'docs/concepts/application',
+    urlPath: '/docs/concepts/application',
-    urlPath: 'concepts/application',
+    urlPath: '/concepts/application',
-    urlPath: '/tools/cli',
+    urlPath: 'tools/cli',

Also applies to: 8-10, 13-15

tests/markdown/check-editlinks.test.js (1)

90-93: Add timeout handling test case

The network error test case only covers generic errors. Add a specific test case for request timeouts.

Add this test case:

it('should handle request timeouts', async () => {
  fetch.mockImplementation(() => new Promise((_, reject) => {
    setTimeout(() => reject(new Error('Request timeout')), 5000);
  }));
  await expect(processBatch(testBatch)).rejects.toThrow('Request timeout');
});
scripts/markdown/check-editlinks.js (1)

103-136: Implement pagination for large directories

The generatePaths function loads all files into memory at once, which could cause issues with large directories.

Consider implementing pagination to process files in chunks:

async function* generatePathsIterator(folderPath, editOptions, batchSize = 100) {
  const stack = [{ path: folderPath, relativePath: '' }];
  let batch = [];

  while (stack.length > 0) {
    const { path: currentPath, relativePath } = stack.pop();
    const files = await fs.readdir(currentPath);

    for (const file of files) {
      const filePath = path.join(currentPath, file);
      const relativeFilePath = path.join(relativePath, file);
      const stats = await fs.stat(filePath);

      if (stats.isDirectory()) {
        stack.push({ path: filePath, relativePath: relativeFilePath });
      } else if (stats.isFile() && file.endsWith('.md') && file !== '_section.md') {
        const urlPath = relativeFilePath.split(path.sep).join('/').replace('.md', '');
        batch.push({
          filePath,
          urlPath,
          editLink: determineEditLink(urlPath, filePath, editOptions)
        });

        if (batch.length >= batchSize) {
          yield batch;
          batch = [];
        }
      }
    }
  }

  if (batch.length > 0) {
    yield batch;
  }
}
📜 Review details

Configuration used: .coderabbit.yaml
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between c13eb87 and 23a3f6d.

📒 Files selected for processing (6)
  • jest.config.js (1 hunks)
  • package.json (1 hunks)
  • scripts/markdown/check-editlinks.js (1 hunks)
  • tests/fixtures/markdown/check-editlinks-data.js (1 hunks)
  • tests/fixtures/markdown/edit-page-config.json (1 hunks)
  • tests/markdown/check-editlinks.test.js (1 hunks)
🚧 Files skipped from review as they are similar to previous changes (2)
  • package.json
  • jest.config.js
🧰 Additional context used
📓 Learnings (1)
scripts/markdown/check-editlinks.js (1)
Learnt from: anshgoyalevil
PR: asyncapi/website#3557
File: scripts/markdown/check-editlinks.js:58-59
Timestamp: 2025-01-08T15:15:00.759Z
Learning: In the AsyncAPI codebase, batch processing operations (like in the Dashboard script and check-editlinks.js) follow a sequential pattern using await in loops, which is the preferred approach for maintaining consistency across the codebase.
🔇 Additional comments (2)
tests/fixtures/markdown/edit-page-config.json (1)

15-17: Verify empty value fallback handling

The empty value entry is used as a fallback, but this could lead to incorrect URL generation if not handled carefully. Ensure that the determineEditLink function properly validates the target before using it.

✅ Verification successful

Empty value fallback is properly implemented and tested

The empty value entry in the configuration is intentionally used as a fallback, with proper handling in the determineEditLink function. The implementation adds necessary path components and has comprehensive test coverage verifying this behavior.

🏁 Scripts executed

The following scripts were executed for the analysis:

Script:

#!/bin/bash
# Check if determineEditLink handles null/undefined targets
ast-grep --pattern 'function determineEditLink($_, $_, $_) {
  $$$
  if (!target) {
    $$$
  }
  $$$
}'

Length of output: 106


Script:

#!/bin/bash
# Search for determineEditLink function with broader context
rg -l "determineEditLink"

# Search for files containing both "edit" and "link"
rg -l "edit.*link|link.*edit" --type js --type ts

# Look for test files related to edit links
fd "edit.*test|test.*edit" -e js -e ts -e jsx -e tsx

Length of output: 402


Script:

#!/bin/bash
# Check implementation
cat scripts/markdown/check-editlinks.js

# Check test cases
cat tests/markdown/check-editlinks.test.js

# Check test data
cat tests/fixtures/markdown/check-editlinks-data.js

Length of output: 13950

scripts/markdown/check-editlinks.js (1)

147-153: 🛠️ Refactor suggestion

Improve error handling in main function

The main function's error handling could be more informative and should set appropriate exit codes.

Apply this diff to improve error handling:

   if (invalidUrls.length > 0) {
     console.log('\nURLs returning 404:\n');
     invalidUrls.forEach((url) => console.log(`- ${url.editLink} generated from ${url.filePath}\n`));
     console.log(`\nTotal invalid URLs found: ${invalidUrls.length}`);
+    process.exitCode = 1;
   } else {
     console.log('All URLs are valid.');
+    process.exitCode = 0;
   }

Likely invalid or redundant comment.

scripts/markdown/check-editlinks.js Outdated Show resolved Hide resolved
scripts/markdown/check-editlinks.js Outdated Show resolved Hide resolved
Copy link

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 0

🧹 Nitpick comments (4)
scripts/markdown/check-editlinks.js (4)

3-3: Consider using native fetch or updating to latest node-fetch

The code uses node-fetch-2 which is an older version. Consider using the native fetch API (available since Node.js 18) or updating to the latest version of node-fetch.

-const fetch = require('node-fetch-2');
+const fetch = (...args) => import('node-fetch').then(({default: fetch}) => fetch(...args));

6-10: Move ignore list to configuration file

Consider moving the ignoreFiles array to the configuration file (edit-page-config.json) to make it more maintainable and configurable without code changes.


142-142: Enhance error message detail

The error message could be more informative by including the original error details.

-    throw new Error(`Error processing directory ${folderPath}:`, err);
+    throw new Error(`Error processing directory ${folderPath}: ${err.message}`);

163-163: Enhance error message detail

Similar to the previous suggestion, include the original error message for better debugging.

-    throw new Error('Failed to check edit links:', error);
+    throw new Error(`Failed to check edit links: ${error.message}`);
📜 Review details

Configuration used: .coderabbit.yaml
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 23a3f6d and c101b66.

📒 Files selected for processing (1)
  • scripts/markdown/check-editlinks.js (1 hunks)
🧰 Additional context used
📓 Learnings (1)
scripts/markdown/check-editlinks.js (2)
Learnt from: anshgoyalevil
PR: asyncapi/website#3557
File: scripts/markdown/check-editlinks.js:80-93
Timestamp: 2025-01-14T09:23:32.728Z
Learning: In the AsyncAPI website's edit link generation system, the `editOptions` array in `edit-page-config.json` includes a fallback entry with an empty string value (`''`) that matches any URL path, ensuring that `determineEditLink()` function always finds a target and never returns null.
Learnt from: anshgoyalevil
PR: asyncapi/website#3557
File: scripts/markdown/check-editlinks.js:58-59
Timestamp: 2025-01-08T15:15:00.759Z
Learning: In the AsyncAPI codebase, batch processing operations (like in the Dashboard script and check-editlinks.js) follow a sequential pattern using await in loops, which is the preferred approach for maintaining consistency across the codebase.
🔇 Additional comments (4)
scripts/markdown/check-editlinks.js (4)

12-20: LGTM! Well-documented helper function

The pause function is well-implemented with clear JSDoc documentation.


58-79: Align batch processing pattern with codebase standards

The current implementation uses concurrent processing with Promise.all, but based on the codebase patterns (as seen in the Dashboard script), sequential processing is preferred.

Would you like to maintain consistency with the codebase's sequential pattern or keep the current concurrent implementation? Please confirm your preference.


88-101: LGTM! Robust edit link determination

The implementation correctly handles the fallback case and special path prefixes. The code aligns with the established pattern where an empty string value in editOptions serves as a catch-all fallback.


172-172: LGTM! Clean exports

The module exports are well-organized and include all necessary functions.

.github/workflows/check-edit-links.yml Outdated Show resolved Hide resolved
.github/workflows/check-edit-links.yml Outdated Show resolved Hide resolved
.github/workflows/check-edit-links.yml Show resolved Hide resolved
scripts/markdown/check-editlinks.js Outdated Show resolved Hide resolved
tests/fixtures/markdown/edit-page-config.json Outdated Show resolved Hide resolved
tests/markdown/check-editlinks.test.js Outdated Show resolved Hide resolved
@@ -0,0 +1,172 @@
const fs = require('fs').promises;
const path = require('path');
const fetch = require('node-fetch-2');
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why node-fetch-2 instead of normal fetch API?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The module problem we had—I don't exactly remember what it was but we weren't able to use node-fetch because of some version issue.

Here's the PR:
#3038

.github/workflows/check-edit-links.yml Outdated Show resolved Hide resolved
```
${{ steps.extract-404.outputs.errors }}
```
MSG_MINIMAL: true
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What does this parameter state for Slack?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It removes a bunch of useless information from the message.

Copy link

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 4

🧹 Nitpick comments (3)
.github/workflows/check-edit-links.yml (3)

4-6: Add detailed cron syntax documentation.

While the cron syntax is correct, consider adding a more detailed comment explaining each field for better maintainability:

   schedule:
-    - cron: '0 0 * * 0' # Runs every week at midnight on Sunday
+    - cron: '0 0 * * 0' # minute(0) hour(0) day(*) month(*) weekday(0=Sunday) - Runs every Sunday at midnight UTC

38-40: Consider moving the if condition.

As suggested in the past review, consider moving the if condition above the step name for better readability and to follow the standard pattern in GitHub Actions:

-      - name: Notify Slack
-        if: ${{ steps.extract-404.outputs.errors != '' }}
+      - if: ${{ steps.extract-404.outputs.errors != '' }}
+        name: Notify Slack

49-49: Document the MSG_MINIMAL parameter.

The MSG_MINIMAL parameter's purpose isn't immediately clear. Add a comment explaining its effect on the Slack notification:

-          MSG_MINIMAL: true
+          MSG_MINIMAL: true # Sends a compact message without additional metadata like GitHub workflow details
📜 Review details

Configuration used: .coderabbit.yaml
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between c101b66 and b035fc4.

📒 Files selected for processing (5)
  • .github/workflows/check-edit-links.yml (1 hunks)
  • package.json (1 hunks)
  • scripts/dashboard/build-dashboard.js (1 hunks)
  • scripts/markdown/check-edit-links.js (1 hunks)
  • tests/markdown/check-edit-links.test.js (1 hunks)
🚧 Files skipped from review as they are similar to previous changes (1)
  • package.json
🧰 Additional context used
📓 Learnings (2)
scripts/markdown/check-edit-links.js (1)
Learnt from: anshgoyalevil
PR: asyncapi/website#3557
File: scripts/markdown/check-editlinks.js:58-59
Timestamp: 2025-01-08T15:15:00.759Z
Learning: In the AsyncAPI codebase, batch processing operations (like in the Dashboard script and check-editlinks.js) follow a sequential pattern using await in loops, which is the preferred approach for maintaining consistency across the codebase.
.github/workflows/check-edit-links.yml (1)
Learnt from: anshgoyalevil
PR: asyncapi/website#3557
File: .github/workflows/check-edit-links.yml:25-29
Timestamp: 2025-01-08T15:16:27.655Z
Learning: In GitHub workflows running scripts with process.exit statements for error handling (like check-editlinks.js in asyncapi/website), avoid adding error suppression (|| true) at the workflow level as it would mask the intended error reporting mechanism.
🪛 eslint
scripts/dashboard/build-dashboard.js

[error] 184-184: Replace ·getLabel,·monthsSince,·mapGoodFirstIssues,·getHotDiscussions,·getDiscussionByID,·getDiscussions,·writeToFile,·start,·processHotDiscussions,·pause· with ⏎··getLabel,⏎··monthsSince,⏎··mapGoodFirstIssues,⏎··getHotDiscussions,⏎··getDiscussionByID,⏎··getDiscussions,⏎··writeToFile,⏎··start,⏎··processHotDiscussions,⏎··pause⏎

(prettier/prettier)

🪛 actionlint (1.7.4)
.github/workflows/check-edit-links.yml

32-32: shellcheck reported issue in this script: SC2129:style:2:1: Consider using { cmd1; cmd2; } >> file instead of individual redirects

(shellcheck)


32-32: shellcheck reported issue in this script: SC2086:info:2:23: Double quote to prevent globbing and word splitting

(shellcheck)


32-32: shellcheck reported issue in this script: SC2086:info:3:19: Double quote to prevent globbing and word splitting

(shellcheck)


32-32: shellcheck reported issue in this script: SC2086:info:4:15: Double quote to prevent globbing and word splitting

(shellcheck)

⏰ Context from checks skipped due to timeout of 180000ms (1)
  • GitHub Check: Test NodeJS PR - macos-13
🔇 Additional comments (6)
scripts/markdown/check-edit-links.js (2)

83-93: ⚠️ Potential issue

Handle undefined 'target' to prevent runtime errors in 'determineEditLink'.

In the determineEditLink function, if no matching edit option is found, target could be undefined, leading to a TypeError when accessing target.value. Please add a check to handle the case when target is undefined to prevent runtime errors.

Apply this diff to fix the issue:

 function determineEditLink(urlPath, filePath, editOptions) {
   // Remove leading 'docs/' if present for matching
   const pathForMatching = urlPath.startsWith('docs/') ? urlPath.slice(5) : urlPath;

   const target = editOptions.find((edit) => pathForMatching.includes(edit.value));

+  if (!target) {
+    // Handle the case where no matching edit option is found
+    return null; // Or handle appropriately based on your application's needs
+  }

   // Handle the empty value case (fallback)
   if (target.value === '') {
     return `${target.href}/docs/${urlPath}.md`;
   }

   // For other cases with specific targets
   return `${target.href}/${path.basename(filePath)}`;
 }

Ensure that downstream code can handle a null return value from this function, or adjust the handling accordingly.


3-3: ⚠️ Potential issue

Correct the import statement for 'node-fetch'.

It appears that 'node-fetch-2' is not a valid package. To import version 2 of node-fetch, you should use 'node-fetch'. Please update the import statement accordingly.

Apply this diff to fix the import statement:

-const fetch = require('node-fetch-2');
+const fetch = require('node-fetch');

Ensure that the node-fetch package is installed in your package.json dependencies.

scripts/dashboard/build-dashboard.js (1)

184-184: Reformat 'module.exports' for better readability as per Prettier guidelines.

The static analysis tool indicates that the module.exports statement should be reformatted for consistency and readability.

Apply this diff to format module.exports:

-module.exports = { getLabel, monthsSince, mapGoodFirstIssues, getHotDiscussions, getDiscussionByID, getDiscussions, writeToFile, start, processHotDiscussions, pause };
+module.exports = {
+  getLabel,
+  monthsSince,
+  mapGoodFirstIssues,
+  getHotDiscussions,
+  getDiscussionByID,
+  getDiscussions,
+  writeToFile,
+  start,
+  processHotDiscussions,
+  pause
+};

Ensure that the updated formatting aligns with the project's coding standards and passes the linter checks.

🧰 Tools
🪛 eslint

[error] 184-184: Replace ·getLabel,·monthsSince,·mapGoodFirstIssues,·getHotDiscussions,·getDiscussionByID,·getDiscussions,·writeToFile,·start,·processHotDiscussions,·pause· with ⏎··getLabel,⏎··monthsSince,⏎··mapGoodFirstIssues,⏎··getHotDiscussions,⏎··getDiscussionByID,⏎··getDiscussions,⏎··writeToFile,⏎··start,⏎··processHotDiscussions,⏎··pause⏎

(prettier/prettier)

tests/markdown/check-edit-links.test.js (1)

2-2: ⚠️ Potential issue

Correct import and mocking of 'node-fetch'.

It appears that 'node-fetch-2' is not a valid package name. Replace 'node-fetch-2' with 'node-fetch' in both the import statement and the Jest mock.

Apply these diffs:

At line 2:

-const fetch = require('node-fetch-2');
+const fetch = require('node-fetch');

At line 13:

-jest.mock('node-fetch-2', () => jest.fn());
+jest.mock('node-fetch', () => jest.fn());

Ensure that the node-fetch package is correctly installed and updated in your package.json dependencies.

Also applies to: 13-13

.github/workflows/check-edit-links.yml (2)

9-11: LGTM! Well-structured job configuration.

The job configuration follows best practices with clear naming and appropriate runner selection.


25-29: Approved: Proper error handling implementation.

The link checker step correctly allows the script's built-in error handling to work without suppression, as noted in the previous learning. This ensures proper error reporting and Slack notifications.

scripts/markdown/check-edit-links.js Outdated Show resolved Hide resolved
scripts/markdown/check-edit-links.js Show resolved Hide resolved
tests/markdown/check-edit-links.test.js Show resolved Hide resolved
.github/workflows/check-edit-links.yml Show resolved Hide resolved
Copy link

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 1

🧹 Nitpick comments (3)
scripts/markdown/check-edit-links.js (3)

3-3: Consider using native fetch or updating to latest node-fetch.

The script uses node-fetch-2 which is an older version. Consider:

  1. Using the native fetch API (available since Node.js 18.0.0)
  2. Updating to the latest node-fetch version
-const fetch = require('node-fetch-2');
+// If using Node.js >= 18.0.0
+const fetch = (...args) => import('node-fetch').then(({default: fetch}) => fetch(...args));

19-19: Extract timeout duration as a named constant.

The timeout duration is a magic number. Consider extracting it to a named constant at the module level for better maintainability.

+const URL_CHECK_TIMEOUT_MS = 5000;
+
 async function processBatch(batch) {
-  const TIMEOUT_MS = 5000;
+  const TIMEOUT_MS = URL_CHECK_TIMEOUT_MS;

51-51: Extract batch size as a named constant.

The batch size is a magic number. Consider extracting it to a named constant at the module level for better configuration.

+const URL_CHECK_BATCH_SIZE = 5;
+
 async function checkUrls(paths) {
   const result = [];
-  const batchSize = 5;
+  const batchSize = URL_CHECK_BATCH_SIZE;
📜 Review details

Configuration used: .coderabbit.yaml
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between b035fc4 and afa5a33.

📒 Files selected for processing (1)
  • scripts/markdown/check-edit-links.js (1 hunks)
🧰 Additional context used
📓 Learnings (1)
scripts/markdown/check-edit-links.js (1)
Learnt from: anshgoyalevil
PR: asyncapi/website#3557
File: scripts/markdown/check-editlinks.js:58-59
Timestamp: 2025-01-08T15:15:00.759Z
Learning: In the AsyncAPI codebase, batch processing operations (like in the Dashboard script and check-editlinks.js) follow a sequential pattern using await in loops, which is the preferred approach for maintaining consistency across the codebase.
🔇 Additional comments (2)
scripts/markdown/check-edit-links.js (2)

102-135: Well-implemented recursive file processing.

The function is well-structured with:

  • Proper error handling
  • Concurrent file processing
  • Clear documentation
  • Edge case handling (e.g., _section.md files)

137-163: Clean main function implementation with proper exports.

The implementation is solid with:

  • Comprehensive error handling
  • Clear console output
  • Well-organized exports
  • Proper module execution check

scripts/markdown/check-edit-links.js Show resolved Hide resolved
@@ -0,0 +1,163 @@
const fs = require('fs').promises;
const path = require('path');
const fetch = require('node-fetch-2');
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why node-fetch-2 is used here?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Like we use it in other "js" scripts.

const fetch = require('node-fetch-2');

* @returns {Promise<string[]>} Array of URLs that returned 404
*/
async function processBatch(batch) {
const TIMEOUT_MS = 5000;
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Make it configurable with env variable as well.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done

*/
async function checkUrls(paths) {
const result = [];
const batchSize = 5;
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Kindly make it configurable using env variable.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done

Copy link

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 0

🧹 Nitpick comments (3)
scripts/markdown/check-edit-links.js (3)

7-11: Consider moving ignore list to configuration file.

The ignoreFiles list could be moved to the configuration file (config/edit-page-config.json) to make it more maintainable and configurable without code changes.


102-135: Consider adding additional file validation.

While the implementation is solid, consider adding validation for:

  • Maximum file size to prevent processing extremely large files
  • File name patterns to exclude temporary or backup files (e.g., .md~, .md.bak)

137-163: Consider enhancing error reporting.

While the error handling is good, consider adding:

  • Total number of URLs checked in the success message
  • Structured error output (e.g., JSON format) for better parsing
  • Error categorization (timeout vs 404 vs other errors)
📜 Review details

Configuration used: .coderabbit.yaml
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between afa5a33 and 22a31ce.

📒 Files selected for processing (1)
  • scripts/markdown/check-edit-links.js (1 hunks)
🧰 Additional context used
📓 Learnings (1)
scripts/markdown/check-edit-links.js (1)
Learnt from: anshgoyalevil
PR: asyncapi/website#3557
File: scripts/markdown/check-editlinks.js:58-59
Timestamp: 2025-01-08T15:15:00.759Z
Learning: In the AsyncAPI codebase, batch processing operations (like in the Dashboard script and check-editlinks.js) follow a sequential pattern using await in loops, which is the preferred approach for maintaining consistency across the codebase.
🔇 Additional comments (3)
scripts/markdown/check-edit-links.js (3)

18-42: Well-implemented batch processing with timeout handling!

The implementation includes:

  • Configurable timeout via environment variable
  • Proper request cancellation using AbortController
  • Comprehensive error handling
  • Test coverage exclusions where appropriate

49-70: Efficient implementation of concurrent batch processing!

The implementation includes:

  • Configurable batch size via environment variable
  • Concurrent processing with Promise.all
  • Rate limiting prevention with pauses between batches

79-92: Clean and efficient edit link determination logic!

The implementation correctly handles both specific targets and fallback cases, with proper path manipulation.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bounty AsyncAPI Bounty program related label
Projects
Status: In Progress
Development

Successfully merging this pull request may close these issues.

Refactor the edit-this-page-on-github workflow
4 participants