Skip to content

Latest commit

 

History

History
182 lines (139 loc) · 6.67 KB

README.md

File metadata and controls

182 lines (139 loc) · 6.67 KB

Blurdle

What

A user script to remove Wordle sharing posts from a Facebook feed

Install

  1. Install Tampermonkey in your web browser

  2. From the Tampermonkey Dashboard, click on the Utilities tab. In the "Import from URL" field, paste:

    https://raw.githubusercontent.com/richfromm/blurdle/master/blurdle.js

    Click the "Install" button.

Tested with Chrome and MacOS, but I think it should work with other browsers and OS's.

It's probably possible to export this as a Chrome extension, but that's future work.

I have no clue if it's possible to install extensions or user scripts on mobile.

Why

I actually like Wordle, and play it regularly. I also sometimes play various derivatives like Nerdle and Absurdle. But I find the not insignificant stream of Wordle shares a little annoying. And I explicitly try to not look at them, because I don't want to get any hints if I haven't done the day's puzzle yet.

A friend joked on Facebook that someone should start a social platform just for the Wordle people. I replied, just write a Greasemonkey script to strip the posts from your feed. The response asked if I was volunteering.

I've never actually done anything like this, but it seemed like a good opportunity to learn. Hence this quick hack.

How

First pass

Originally, I took a look at the source of a sample page with a Wordle post in the feed. Each post seemed to be a very heavily nested sequence of <div> tags. I saw:

                    <div data-pagelet="FeedUnit_0">

then:

                    <div data-pagelet="FeedUnit_1">

then literally a bunch like this (that is, I'm not using "n" as a numeric substitution):

                    <div data-pagelet="FeedUnit_{n}">

Further down the <div> element rabbithole, is the start of a Wordle share:

                                     <div class="kvgmc6g5 cxmmr5t8 oygrvhab hcukyx3x c1et5uql">
                                      Wordle 242 3/6
                                     </div>

I used JQuery to find <div> tags with a data-pagelet attribute that starts with FeedUnit_. They are further filtered to only return ones that have some nested <div> tag that contains Wordle. But I didn't want to delete any post mentioning "Wordle" at all, just ones that look like a share, so that's further filtered with having some inner HTML that matches what appears to be the beginning of a Wordle share (the text in the example above) via a regular expression.

The Facebook news feed is infinitely scrolling and populated dynamically. So we can't just filter a fixed document and call it a day. The actual filtering is contained in a function filter(), that's called on the current page contents once a second.

Update

This worked great, for about 3 weeks. As I had anticipated might happen (but maybe a bit sooner than I had hoped), the Facebook output changed for me on 2022-03-08, breaking my script.

The <div> tags that contain each item in the news feed no longer have a data-pagelet attribute that starts with FeedUnit_. In fact, they no longer have any attributes at all.

The next two levels in the hierarchy of <div> insanity (immediately after the level that formerly contained the data-pagelet attribute) have precisely the following attributes:

                     <div class="du4w35lb k4urcfbm l9j0dhe7 sjgh65i0">
                      <div class="du4w35lb l9j0dhe7">

So that's what I'm using now. Well, at least the first line. I search for all <div> tags with precisely that class attribute, then filter for Wordle posts like before, and finally take their parent tags (the <div> tags that now have no attributes) and remove them.

Latest Update

As of this writing (2022-08-19), this is broken again.

At least it lasted a bit longer this time, a bit more than 2 months.

The structure is now similar to the previous update, but the specifics of the classes have changed. Each news feed post now starts with the following:

                     <div class="b6ax4al1">
                      <div>
                       <div class="g4tp4svg mfclru0v om3e55n1 p8bdhjjv">
                        <div class="g4tp4svg om3e55n1">

I could likely get this to work (and may do so as a first pass) just by changing the beginning of my div filter to look for class='b6ax4al1'. But it would be nice if this were a little less fragile.

A better approach may be to take advantage of the fact that the top of the news feed looks something like this:

                    <div role="feed">
                     <h3 class="..." dir="auto">
                      News Feed posts
                     </h3>

Not precisely, as I've left out the class attribute of the <h3> tag, and it has changed over time.

It would be a bit more robust to search for the enclosing <div> tag with the role="feed" attribute, and/or the <h3> tag with the text News Feed posts, then assume that the next subsequent <div> tag lower in the hierarchy with a non-empty class attribute can be used as the signaling key for the beginning of a news feed post.

The News Feed posts text might be more fragile than the role="feed" attribute. Somewhat interestingly, I don't actually see that text header visible anywhere in my browser window (Google Chrome on Mac OS), although the Find function (via Cmd-F) is able to locate a single occurrence of it. I wonder if the lack of visibility makes it more or less likely to change its text.

I hope to get a chance to work on this soon. Right now I need to get back to looking for a job.

Caveats

As I mentioned when I first posted this, like any kind of screen scraping, this is inherently fragile. If either Facebook or Wordle makes changes to their output format, it can easily break, and already has.

The update takes the fragility of this implementation to a whole new level. However, the specific class attribute that I am now selecting upon was present in the exact same form even before the breakage, and it continues to be present unchanged as of this writing, which is 3 months after the breakage. So maybe I'll get lucky and this will last a bit longer this time.

I am a software developer, but not a front end guy. My experience with JavaScript is extremely limited, and this was my first time writing a user script, as well as my first time using JQuery. I suspect that this might not be the most efficient solution, and I'm happy to accept suggestions for improvement.