- Using
curl
instead ofwget
- Fix #36 (unable to read cookie file)
- Fix #34 (
413 Request Entity Too Large
)
- Loop detection: #24.
- Add test cases.
- Update documentation (Cookie issue.)
- Minor code improvements.
- Group with category support (#28, Thanks @LeeKevin)
- Fix bugs: #6 (compatibility issue), #13 (so large group), #16 (email exporting and third-party license issue)
- Fix script shebang.
- Google organization support.
- Ensure group name is in lowercase.
- Minor scripting improvements.
- Drop the use of
lynx
program.wget
handles all download now. - Accept
_WGET_OPTIONS
environment to controlwget
commands. - Can work with private groups thanks to
_WGET_OPTIONS
environment. - Rename script (
craw.sh
becomescrawler.sh
.) - Output important variables to the output script.
- Update documentation (
README.md
.)
- Provide fancy agent to
wget
andlynx
command. - Fix wrong URL of
rss
feed. - Use
set -u
to avoid unbound variable. - Fix display charset of
lynx
program. See #3.
- The first public version.