Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

2.4.0 Release #565

Merged
merged 90 commits into from
Jun 8, 2020
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
90 commits
Select commit Hold shift + click to select a range
c1f0f75
indexsource: add new XmlQueryIndexSource
ikreymer Feb 2, 2018
94eb4ad
loaders: add WebHDFSLoader loader to support handling 'webhdfs://' sc…
ikreymer Feb 2, 2018
ec88e96
indexsource: add tests for XmlQueryIndexSource, add missing init_from…
ikreymer Feb 5, 2018
959481f
loaders: webhdfs loader: support optional '&user.name=<name>' param f…
ikreymer Feb 8, 2018
b38cfb8
apps: frontendapp customizations (to support ukwa/ukwa-pywb#6)
ikreymer Feb 10, 2018
5b7ca18
rewriting: try more granular modifers to distinguish embeds: (in part…
ikreymer Feb 10, 2018
77eefcd
- support for allow/block/exclude access controls (as per ukwa/ukwa-p…
ikreymer Feb 18, 2018
a3f81dc
access system work for ukwa/ukwa-pywb#7
ikreymer Feb 18, 2018
bfa3aa7
wb-manager acl command: support manipulating sorted access-list .aclj…
ikreymer Feb 21, 2018
0c1dfba
aclmanager: add unit tests for 'wb-manager acl' commands (ukwa/ukwa-p…
ikreymer Feb 21, 2018
ef9051a
yaml loader: support env var interpolation in loaded YAML using os.ex…
ikreymer Feb 25, 2018
3020606
simplify exception handling:
ikreymer Feb 25, 2018
0d68f67
routes: make coll route config extendable to support prefix routing f…
ikreymer Feb 27, 2018
5364275
memento prefer header: add support for Prefer header for specifying '…
ikreymer Feb 24, 2018
a301dda
memento prefer header improvements: (ukwa/ukwa-pywb#12)
ikreymer Feb 25, 2018
871cef2
proxy mode and prefer header: (ukwa/ukwa-pywb#16)
ikreymer Feb 28, 2018
f30b280
self-redirect check: run redirect check if status code is blank or do…
ikreymer Mar 2, 2018
43537fe
error messaging: app path not found use default error.html template
ikreymer Mar 2, 2018
af3e9c6
error reporting: ensure NotFoundException used for replay not found e…
ikreymer Mar 3, 2018
465195f
static path prefix fix to support non-root pywb deployment:
ikreymer Jul 25, 2018
8bf2f9d
Added some print statements for debugging.
anjackson Feb 14, 2019
b8124e3
lxml query parsing fix: (addressing part of ukwa/ukwa-pywb#38)
ikreymer Feb 14, 2019
60ad173
Moar prints.
anjackson Feb 14, 2019
0c08b9b
acl optimization: addresses ukwa/ukwa-pywb#38
ikreymer Feb 14, 2019
ce0ed61
memento-fix: fix for ukwa/ukwa-pywb#37.
ikreymer Feb 14, 2019
9b2ae35
acl optimization: fixes ukwa/ukwa-pywb#39
ikreymer Feb 15, 2019
c65f66e
acl optimize/fixes:
ikreymer Feb 15, 2019
5da6122
memento timemap fix: further fix for ukwa/ukwa-pywb#37
ikreymer Feb 15, 2019
3868f5b
fix typo: undo unintended change from warning in earlier commit, stic…
ikreymer Feb 15, 2019
0a9ad5c
timemap format fix: fixes ukwa-pywb/pywb#37
ikreymer Feb 16, 2019
54a4e38
memento 404 fix: ensure timemap only includes memento headers on succ…
ikreymer Feb 18, 2019
c00f30e
Double-quoting XmlQueryIndexSource lookups for #29
anjackson Feb 27, 2019
2a30731
Log query being executed.
anjackson Feb 27, 2019
cb3d119
Use space and let quote_plus encode to plus (and avoid it becoming %2…
anjackson Feb 27, 2019
e92b196
xmlindexsource: fix tests for double escaping of query (for ukwa/ukwa…
ikreymer Mar 7, 2019
42b8c3a
merge: additional fixes after merge of ukwa/pywb and 2.2
ikreymer Mar 8, 2019
1b0c9c6
misc fixes from merge:
ikreymer Mar 8, 2019
3589240
ui template overhaul to simplify customization:
ikreymer Mar 9, 2019
7ac9a37
acl: support for exact acl rules via '###' suffix
ikreymer Mar 8, 2019
e04adea
transclusions/augmentations: add new video/audio translcusions script
ikreymer Mar 9, 2019
ce10d9a
docstrings: add docstrings, remove duplicate call, cleanup ACLManager…
ikreymer Apr 3, 2019
1a7fdd0
documented and cleaned up the aclmanager.py
N0taN3rd Apr 3, 2019
41c3712
documented and cleaned up the aclmanager.py2
N0taN3rd Apr 3, 2019
9a40d29
added lxml requirments entry to extra_requirments.txt and documented …
N0taN3rd Apr 3, 2019
8d98b91
added additional code documentation in order to meet the documentatio…
N0taN3rd Apr 10, 2019
61b6ff2
added missing comma to setup.py's tests_require list
N0taN3rd Sep 4, 2019
e34606c
static files:
N0taN3rd Sep 4, 2019
ae78a95
templates
N0taN3rd Sep 4, 2019
69f7f02
static files:
N0taN3rd Sep 4, 2019
5ab97a4
templates:
N0taN3rd Sep 4, 2019
d6ab31d
templates:
N0taN3rd Sep 5, 2019
379f7de
manual
N0taN3rd Sep 5, 2019
802b9fa
apps:
N0taN3rd Sep 10, 2019
a3294c8
fix exception handling:
ikreymer Sep 11, 2019
2f6fb74
bump version to 2.4.0
ikreymer Sep 11, 2019
dc30c89
enable new transclusion system for tests (not enabled by default)
ikreymer Sep 11, 2019
9ce3242
Merge pull request #453 from webrecorder/ukwa-merge
ikreymer Oct 8, 2019
59b735e
tests: fix all tests for updated to webenact, use https when possible…
ikreymer Oct 26, 2019
35004c1
Fixed calendar view dropping query parameters by using encodeURICompo…
N0taN3rd Oct 26, 2019
6f79840
Docs, custom metadata improvements (#509)
ikreymer Oct 27, 2019
fed3263
Docs: Fix access controls and ui customizations docs links (#513)
ikreymer Oct 31, 2019
8baa8cb
docs: fix doc typo in BaseWarcServer example (#507)
yvmarques Nov 1, 2019
02cc703
query: fix query for IE11, don't use ES6 syntax, add URL polyfill (#514)
ikreymer Nov 1, 2019
44dcd39
UI: tweak query page to be responsive (#515)
m4rk3r Nov 1, 2019
fe09d99
rewrite fix: don't inject checkThis function into every script, now h…
ikreymer Nov 7, 2019
66ac3ca
config limit: add query_limit config options to specify optional limi…
ikreymer Nov 7, 2019
0d819aa
Localization and Banner Update (#517)
ikreymer Nov 11, 2019
c7fdfe7
Restrict POST query size (#519)
ikreymer Nov 12, 2019
3068080
proxy mode: replay improvements for content not captured via proxy mo…
ikreymer Nov 12, 2019
0be8452
index query limit: ensure 'limit' is correctly applied to XmlQueryInd…
ikreymer Nov 22, 2019
523e35d
fuzzy matching: apply fuzzy match if url prefix and regex match, even…
nlevitt Dec 21, 2019
f0b9d5b
Rewriting fix for DASH FB and document.write (#529)
ikreymer Jan 11, 2020
fb8aa7c
revisit lookup fix (possible fix for ukwa/ukwa-pywb#53) (#530)
ikreymer Jan 11, 2020
93ce4f6
Banner fix (#531)
ikreymer Jan 11, 2020
fa021ee
Misc Fixes for RC5 (#534)
ikreymer Jan 18, 2020
92e459b
R6 - Various Fixes (#540)
ikreymer Feb 21, 2020
6b014d0
try to remove headers with illegal characters. arquivo/pwa-technologi…
danielbicho Apr 30, 2020
8d8cf7e
Fix documentation: replace fl to fields on doc webrecorder/pywb#542 (…
igobranco Apr 30, 2020
8f0ce45
docs: fix proxy default timestamp yaml example (#544)
thomas536 Apr 30, 2020
be90e06
MANIFEST.in: Create (#559)
jayvdb Apr 30, 2020
871a05a
proxy mode: respect settings when started from cli (#557)
micronn May 1, 2020
7e56ca8
RC7 Fixes (#561)
ikreymer May 1, 2020
ed89fcc
rules: update yt rules
ikreymer Jun 2, 2020
5e9b13e
proxy mode: don't rewrite xml for ajax requests. Support python 3.8 (…
ikreymer Jun 8, 2020
3c53c27
memento timegate: make timegate headers for /<coll>/<url> behave corr…
ikreymer Jun 8, 2020
8a6475a
is-ajax check: only check Sec-Fetch-Mode in proxy mode, only treat 'c…
ikreymer Jun 8, 2020
d7d83b0
new transclusions: use urn:embeds:<url> for embeds resource lookup in…
ikreymer Jun 8, 2020
af76ce9
appveyor: fix appveyor builds, add py38
ikreymer Jun 8, 2020
47e87ef
CHANGES: bump version and update changelist for 2.4.0
ikreymer Jun 8, 2020
c7373ba
update to latest wombat for 2.4.0 release
ikreymer Jun 8, 2020
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 0 additions & 2 deletions .dockerignore
Original file line number Diff line number Diff line change
@@ -1,8 +1,6 @@
build/
dist/
karma-tests/
sample_archive/
tests/
tests_disabled/
venv/
collections/
Expand Down
2 changes: 2 additions & 0 deletions .travis.yml
Original file line number Diff line number Diff line change
Expand Up @@ -5,6 +5,7 @@ python:
- "3.5"
- "3.6"
- "3.7"
- "3.8"

dist: xenial

Expand Down Expand Up @@ -39,6 +40,7 @@ after_success:
matrix:
allow_failures:
- env: WR_TEST=yes
- python: "2.7"

exclude:
- env: WR_TEST=yes
Expand Down
2 changes: 1 addition & 1 deletion .travis/install.sh
Original file line number Diff line number Diff line change
Expand Up @@ -2,11 +2,11 @@
set -e

pip install --upgrade pip setuptools
pip install 'Markupsafe<2.0.0'
python setup.py -q install
pip install -r extra_requirements.txt
pip install coverage pytest-cov coveralls
pip install codecov
npm install

if [ "$WR_TEST" = "yes" ]; then
git clone https://github.com/webrecorder/webrecorder-tests.git
Expand Down
36 changes: 36 additions & 0 deletions CHANGES.rst
Original file line number Diff line number Diff line change
@@ -1,3 +1,39 @@
pywb 2.4.0 changelist
~~~~~~~~~~~~~~~~~~~~~

This release includes significant update, specifically merging of https://github.com/ukwa/pywb branch into this release.
A few selected improvements:

* New Access Control System: https://pywb.readthedocs.io/en/latest/manual/access-control.html

* Support for Localization, configuring multiple languages (not enabled by default): https://github.com/ukwa/ukwa-pywb/blob/master/docs/localization.md

* Support for OpenWayback-style XML-based index source (xmlquery)

* Support for loading from WebHDFS via `webhdfs://` scheme.

* Initial support for a new embeds/transclusions replay system, in combination with warcit: https://github.com/webrecorder/warcit/wiki/Warcit-Video-Audio-Conversion

* Proxy mode improvements: handle OPTIONS requests and CORS `#520 <https://github.com/webrecorder/pywb/pull/520>`_

* Memento Prefer header: support for experimental `Prefer` header to select 'raw' or 'rewritten' memento

* Other memento fixes: fix timemap including invalid mementos, correct timegate behavior on top frame `#564 <https://github.com/webrecorder/pywb/pull/564>`_

* Fixes for collection metadata display: `#509 <https://github.com/webrecorder/pywb/pull/520>`_

* Fix for incorrected WARC record length due to re-serialized headers: `#561 <https://github.com/webrecorder/pywb/pull/561>`_

* Filter invalid WARC records `#536 <https://github.com/webrecorder/pywb/pull/536>`_

* Updated fuzzy matching rules and wombat client-side rewriting.


For the full changelist, see this PR: `#565 <https://github.com/webrecorder/pywb/pull/565>`_

* Access Control System


pywb 2.3.5 changelist
~~~~~~~~~~~~~~~~~~~~~

Expand Down
26 changes: 26 additions & 0 deletions MANIFEST.in
Original file line number Diff line number Diff line change
@@ -0,0 +1,26 @@
include LICENSE
include *.rst
include *requirements.txt
include *.yaml
include *.yml
include .gitmodules
include .dockerignore
include Dockerfile
include Vagrantfile
include uwsgi.ini
include run-tests.py
include *.sh
recursive-include static *.js
recursive-include pywb *.ini
recursive-include pywb *.md
recursive-include pywb *.py
recursive-include pywb *.yaml
recursive-include sample_archive *.aclj
recursive-include tests *.po
recursive-include tests *.yaml
recursive-include tests_disabled *.py
recursive-include tests_disabled *.yaml
recursive-include docs *.bat
recursive-include docs *.py
recursive-include docs *.rst
recursive-include docs Makefile
14 changes: 6 additions & 8 deletions README.rst
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
Webrecorder pywb 2.3
Webrecorder pywb 2.4
====================

.. image:: https://travis-ci.org/webrecorder/pywb.svg?branch=master
Expand Down Expand Up @@ -39,7 +39,11 @@ The 2.x release included a major overhaul of pywb and introduces many new featur

* Standalone, modular `client-side rewriting system (wombat.js) <https://github.com/webrecorder/wombat>`_ to handle most modern web sites.

* Improved 'calendar' query UI, grouping results by year and month, and updated replay banner.
* Improved 'calendar' query UI with incremental loading, grouping results by year and month, and updated replay banner.

* New in 2.4: Extensible UI customizations system for modifying all aspects of the UI.

* New in 2.4: Robust access control system for blocking or excluding URLs, by prefix or by exact match.


Please see the `full documentation <https://pywb.readthedocs.org>`_ for more detailed info on all these features.
Expand Down Expand Up @@ -67,12 +71,6 @@ Contributions & Bug Reports

Users are encouraged to fork and contribute to this project to keep improving web archiving tools.

A few key features are high on list of priorities, but have not yet been implemented, including:

* Url Exclusion System

* UI Improvements

If you are interested in contributing, especially to any of these areas, please let us know!

Otherwise, please take a look at `list of current issues <https://github.com/webrecorder/pywb/issues>`_ and feel free to open new ones about any aspect of pywb, including the new documentation.
Expand Down
7 changes: 5 additions & 2 deletions appveyor.yml
Original file line number Diff line number Diff line change
Expand Up @@ -3,26 +3,29 @@ environment:
CMD_IN_ENV: "cmd /E:ON /V:ON /C obvci_appveyor_python_build_env.cmd"

matrix:
- PYTHON: "C:\\Python27"
- PYTHON: "C:\\Python27-x64"
- PYTHON: "C:\\Python35"
- PYTHON: "C:\\Python35-x64"
- PYTHON: "C:\\Python36"
- PYTHON: "C:\\Python36-x64"
- PYTHON: "C:\\Python37"
- PYTHON: "C:\\Python37-x64"
- PYTHON: "C:\\Python38"
- PYTHON: "C:\\Python38-x64"



install:
- "SET PATH=%PYTHON%;%PYTHON%\\Scripts;%PATH%"
- "python -m pip install --upgrade pip"
- "pip install -U setuptools"
- "pip install MarkupSafe==1.1.1"
- "pip install coverage pytest-cov"
- "pip install cffi"
- "pip install pyopenssl"
- "pip install pypiwin32"
- "pip install certauth boto3 youtube-dl pysocks"
- "pip install codecov"
- "pip install wheel"

build_script:
- "python setup.py install"
Expand Down
8 changes: 8 additions & 0 deletions docs/code/pywb.manager.rst
Original file line number Diff line number Diff line change
Expand Up @@ -4,6 +4,14 @@ pywb\.manager package
Submodules
----------

pywb\.manager\.aclmanager module
--------------------------------

.. automodule:: pywb.manager.aclmanager
:members:
:undoc-members:
:show-inheritance:

pywb\.manager\.autoindex module
-------------------------------

Expand Down
8 changes: 8 additions & 0 deletions docs/code/pywb.rewrite.rst
Original file line number Diff line number Diff line change
Expand Up @@ -100,6 +100,14 @@ pywb\.rewrite\.rewrite\_hls module
:undoc-members:
:show-inheritance:

pywb\.rewrite\.rewrite\_js\_workers module
------------------------------------------

.. automodule:: pywb.rewrite.rewrite_js_workers
:members:
:undoc-members:
:show-inheritance:

pywb\.rewrite\.rewriteinputreq module
-------------------------------------

Expand Down
12 changes: 12 additions & 0 deletions docs/code/pywb.rst
Original file line number Diff line number Diff line change
Expand Up @@ -14,6 +14,18 @@ Subpackages
pywb.utils
pywb.warcserver

Submodules
----------

pywb\.version module
--------------------

.. automodule:: pywb.version
:members:
:undoc-members:
:show-inheritance:


Module contents
---------------

Expand Down
8 changes: 8 additions & 0 deletions docs/code/pywb.utils.rst
Original file line number Diff line number Diff line change
Expand Up @@ -60,6 +60,14 @@ pywb\.utils\.memento module
:undoc-members:
:show-inheritance:

pywb\.utils\.merge module
-------------------------

.. automodule:: pywb.utils.merge
:members:
:undoc-members:
:show-inheritance:

pywb\.utils\.wbexception module
-------------------------------

Expand Down
16 changes: 16 additions & 0 deletions docs/code/pywb.warcserver.rst
Original file line number Diff line number Diff line change
Expand Up @@ -12,6 +12,22 @@ Subpackages
Submodules
----------

pywb\.warcserver\.access\_checker module
----------------------------------------

.. automodule:: pywb.warcserver.access_checker
:members:
:undoc-members:
:show-inheritance:

pywb\.warcserver\.amf module
----------------------------

.. automodule:: pywb.warcserver.amf
:members:
:undoc-members:
:show-inheritance:

pywb\.warcserver\.basewarcserver module
---------------------------------------

Expand Down
2 changes: 2 additions & 0 deletions docs/index.rst
Original file line number Diff line number Diff line change
Expand Up @@ -16,6 +16,8 @@ A subset of features provides the basic functionality of a "Wayback Machine".

manual/usage
manual/configuring
manual/access-control
manual/ui-customization
manual/architecture
manual/apis
code/pywb
Expand Down
Loading