Moodle resource downloader

About

Web scrapper built with Puppeteer and Typescript, with its focus being downloading moodle resources. Additionally it organises them for each module and section/week.

Why?

I made this because I needed to download a lot of resources and automating the process seemed a fun way to learn about web scraping.

DISCLOSURE

This scrapper was developed based on the interface of a education institution that I was part of. So it might not work on yours.

Pre-Requisites

Before installing the scrapper, you will need the following:

You will need to have Node.js installed. A version of grater or equal to v12.x. You can download here: https://nodejs.org/en/

Installation

Install the dependencies: npm install
Build the project: npx tsc

Configuration

There are some run configurations you need to set up. Either via command line and/or configuration file (/scrapper-config.json).

There is no scrapper-config.json because private information might be present there, so create one based on the file: template-scrapper-config.json

Configuration Templates:

Configuration templates are present in the folder: scrapper-configs.

For example if you are using Greenwich's moodle, just create a scrapper-config with the contents of scrapper-configs/scrapper-config-greenwich.json

Templates available:

Greenwich: scrapper-configs/scrapper-config-greenwich.json

Run configuration

You can run the scrapper by simply using npm run start or node ./dist/main.js. Remember to pay attention to the command line for inputs.

Configuration Parameters

If there there is a option that is present in both command line and configuration file, the CLI flag and JSON parameter will be seperated by a |. The required parameters are authorizeUrl, waitPageAfterLogin, modulesListPage.

--authorize-url | authorizeUrl The usual host moodle link you use to enter it. Include the protocol(e.g http, https, https://mymoodle.com)
--wait-page-after-login | waitPageAfterLogin - What page should the scrapper wait after authenticating. For example, it can be the dashboard or the home page.
--modules-list-page | modulesListPage - What page contains the list of modules. This scrapper uses the Dashboard as basis to obtain the modules list.
--username <moodle-login-username> | username The username you use to login. (UNOPERATIONAL AT THE MOMENT)
--download-path <path> | downloadPath The path where the resources will be downloaded(By default it's the ./downloads folder located in the root of the project).
--headless | headless When this parameter is set to true, Puppeteer will be executed with the option with the headless mode activated. Headless mode allows the scrapper to run without displaying the UI. Default value is false. Optional. (DO NOT USE. BUGS AT THE MOMENT)
--auth-method | authMethod - It can have one of the following values: "user-control", "terminal-user-passw". Default user-control. (terminal-user-passw IS NOT OPERATIONAL.)
- user-control allows you to insert your username and password in the browser's page like an usual login procedure. It's useful if you don't want to input your credentials in the terminal, to see the scrapper steps, and if for some reason the authentication requires more input than just password and username. Only disadvantage is that the UI will need to be displayed, which requires the headless mode to be disabled.
- terminal-user-passw The username and password will be prompted by the terminal. Doing this way allows the scrapper/puppeteer to run in headless mode.
- If you want to use terminal-user-passw, it can also be run with the shortcut: npm run start:auth-terminal

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
downloads		downloads
scrapper-configs		scrapper-configs
src		src
.gitignore		.gitignore
LICENCE		LICENCE
README.md		README.md
downloads_folder_screenshot.png		downloads_folder_screenshot.png
package-lock.json		package-lock.json
package.json		package.json
template-scrapper-config.json		template-scrapper-config.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Moodle resource downloader

About

Why?

DISCLOSURE

Pre-Requisites

Installation

Configuration

Configuration Templates:

Run configuration

Configuration Parameters

About

Releases 2

Packages

Contributors 2

Languages

License

bigfarofa/moodle-resource-downloader

Folders and files

Latest commit

History

Repository files navigation

Moodle resource downloader

About

Why?

DISCLOSURE

Pre-Requisites

Installation

Configuration

Configuration Templates:

Run configuration

Configuration Parameters

About

Resources

License

Stars

Watchers

Forks

Releases 2

Packages 0

Contributors 2

Languages

Packages