Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature/reduce spidering #111

Open
wants to merge 2 commits into
base: master
Choose a base branch
from
Open

Feature/reduce spidering #111

wants to merge 2 commits into from

Conversation

jechols
Copy link
Contributor

@jechols jechols commented Nov 12, 2014

Simplifies the robots.txt rules and adds an index to disallow spidering of /images/tiles. In chronam core, /images/tiles isn't the only URL that generates tiles (see ticket #109 ), but I believe it is the primary URL for OpenSeadragon.

Unfortunately, any sites still using the old URL style (/lccn/snxxxxxxxxx/YYYY-MM-DD/ed-#/seq-#/image_514x514_from_1022,4094_to_2050,5122.jpg) won't benefit from this, but addressing #109, and having users update local themes, would allow this to work for everybody. (Or extra rules could be added to catch this style of URL, too)

It might seem strange to disallow bots from hitting the tiles, but Apache log analysis showed us that we were getting a huge amount of traffic from bots - including thumbnails and the dynamic tiles.

Blocks all bots instead of just named bots, and blocks thumbnails since
they're all dynamically generated and bots can really slow the site down
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant