Jump to content

Legal talk:Notices received from search engines

From Wikimedia Foundation Governance Wiki

This page hasn't been updated in three years

Why have the Wikimedia Foundation stopped updating this page, has their position changed on content removal notices? The latest Wikimedia Transparency Report from late 2021 (there were none in 2022) included the passage linking here "We have a dedicated page where we post notices of delisted project pages that we have received from the search engines who provide such information as part of their own commitments to transparency." However, the latest information displayed on the page is from the second half of 2019, not 2021 let alone 2022. Has Wikimedia received no requests in the last three years or is it not willing to publish them anymore, and why were there no Transparency Reports last year? --Voello (talk) 02:22, 5 January 2023 (UTC)Reply

Postings flowing again, but with missing URLs

The posting of notices to this page appears to have resumed after quite a long silence (see above), but something has gone awry with the latest uploads.

Reports for English Wikipedia through File:En72RTBF.pdf (18 May 2024) included the "Here are the affected URL(s)" section of the notice, making the affected page visible. (Although as a screenshot PDF, it's still annoyingly not possible to simply copy-paste the text, like it would be on the original notice page.)

However, starting with File:En73RTBF.pdf and as far as I can tell including every capture since, the affected URL(s) section is not visible in the captured notice, making it uselessly generic. It's of no help to anyone to simply see that some notice was sent, without the affected pages being documented as well.

I see the same problem with postings for other wikis. (See, for example, File:Fr54RTBF.pdf and File:De65RTBF.pdf.)

The files (good and bad) were all uploaded by KFrancis (WMF), so my apologies for singling you out with a ping on this. FeRDNYC (talk) 14:23, 6 July 2024 (UTC)Reply

Also, if I might make two suggestions:
  1. While I understand the need to archive these files in a static form like a PDF, there are a number of good tools for doing so without converting the page to an image, so that embedded text remains readable and copy/paste-friendly, and all of the content is captured (not just whatever's visible on the screen at the moment).
    • For Google Chrome, there are several extensions in the Chrome Store that provide page-to-PDF functionality. A number of those also exist for Firefox. I couldn't possibly speculate on Safari, Edge, or any other terrible browsers.
    • But an extension might not even be needed. Most operating systems provide (or can have installed) a print driver that saves to PDF. Using that, a page can be archived via the browser's "Print..." functionality. (Again in Chrome, it's necessary to bypass the Chrome built-in print tools by selecting "Print using system dialog" to bring up the native print interface, but printing even secure pages to PDF using that method is typically very effective.)
  2. Issues of notice archiving and included data aside, rather than sequentially numbering the notices ("En##RTBF.pdf", "De##RTBF.pdf"), it might be more helpful to save them with the date of the notice in the filename, instead.

    For File:En84RTBF.pdf (the latest English notice), an alternate name might be EnRTBF-20240626.pdf, for example. Saved that way, the filenames would provide more visibility into both when notices are received, and just reading through the list(s) of files would give an at-a-glance impression of how frequently a given wiki's pages are subjected to search-engine removal.

    In the unlikely event that more than one notice is received on the same day for a given wiki (though it doesn't seem as though they're coming fast enough for that to be a concern), a _1, _2, etc. could be appended to the date portion of the filename.

Regarding #1, I'd be happy to provide whatever help I can in working out a solution for archiving these notices in a more complete and accessible format, while still ensuring that it meets all of Legal's criteria for permanence and immutability. FeRDNYC (talk) 14:41, 6 July 2024 (UTC)Reply
Hello! Thank you for your message. For a bit of background, these notices were consistently posted until there was a glitch where legal stopped receiving the notices around mid-2019 (hence the large backlog). The glitch has since been fixed and I've been working on getting the notices missed posted. -The new notices, from this calendar year (2024) and the end of 2023 have strangely been missing the URL of the takedown notices (the only information included was the language of the project and some boilerplate language explaining why the takedown URL was not listed). We don't have any visibility into Google's internal policies but it seems reasonable to assume they have an internal policy where some takedown URLs can't be listed. I understand this is frustrating as the notices posted recently are not all that helpful. - There are a few notices received this week, however, that DO list the takedown URL. Those will be posted in order and the backlog is log.
If there is a better/more helpful way to post the takedown notices, I would be very happy to work with you to come up with a more useful solution. KFrancis (WMF) (talk) 00:12, 13 July 2024 (UTC)Reply
@KFrancis (WMF) Thank you for the updates. I've found this article about the decision of Google https://www.seroundtable.com/google-right-to-be-forgotten-takedown-notices-missing-36905.html Pyb en résidence (talk) 09:50, 29 August 2024 (UTC)Reply
@Pyb en résidence Ooh, good find! That does appear to explain the change... though the reasoning behind said change is confounding in a way that only courts of law can really aspire to.
I should apologize, @KFrancis (WMF), for my assumptions in my original message; when I saw that the recent PDFs didn't include URLs, but did have a small blue rectangle at the bottom of the page that looked like the top of the box where the URL appeared in the older messages, I assumed the newer PDFs were simply incomplete (cut off before the URL), and compounded my error by then further assuming it was a process issue on our end. Neither assumption was correct, so my apologies twice over.
Now understanding the situation, in light of that court ruling barring Google from including the URLs in question, the only thing I can't figure out now is why they're bothering to continue (or resume?) sending notices — without the URL, those notices are useless. They contain absolutely no information worth being notified about.
They probably aren't worth archiving, either, since every notice is now identical — "we removed some unspecified pages at your site from the search results for some unspecified search terms". That information is... well, it's completely useless to us, really. (Not to mention, non-actionable.)
Since this is a Legal project page I'm not going to fiddle with the content (IANAL), but it might be worth updating Legal:Notices received from search engines with...
  1. A link, in the opening "For context" section, to either the Guardian article (https://www.theguardian.com/technology/2024/feb/15/google-stops-notifying-publishers-of-right-to-be-forgotten-removals-from-search-results) that @Pyb en résidence's SEO blog article sourced its info from, or a link to the very dry, but very detailed International Association of Privacy Professionals post (https://iapp.org/news/a/swedish-court-rejects-googles-appeal-in-rtbf-case/) that the Guardian quoted and seems to have used as their primary source.
  2. Some changes to the rest of the intro text, since it discusses "the subject of the affected webpage" and editors making changes to "address the referred content", neither of which are applicable if we don't know the URLs removed. The closing paragraph also ends with, "Search engines are not required by law to provide notices, so we appreciate the companies who share our commitment to free speech and transparency. Compelled censorship is unacceptable, but compelled censorship without notice is unforgivable." which feels like it's implicitly discussing notices that include URLs, since with the URLs removed — regardless of the reasons, and whether or not Google had any choice in the matter — it's difficult to characterize the notices as transparent.
FeRDNYC (talk) 04:23, 4 September 2024 (UTC)Reply

What is the URL to file a DMCA counter-notice with Google?

@KFrancis (WMF): Most of these notices seem to include a link in the original email for filing a DMCA counter-notice. However, the links in the PDF versions doesn't work (as it seems to just be a screenshot of the email). Thus no one can actually file a counter-notice. Can you provide the URL that the links go to so that we can actually file counter-notices as appropriate? Or else create the PDFs from the actual email HTML, not images of the emails. Thank you. Nosferattus (talk) 03:31, 25 March 2025 (UTC)Reply

After much searching, I did find https://reportcontent.google.com/landing/counter_notice?product=websearch&product_ctx&url&content_id&hl=en. However this form requires entering a "reference ID" for the original takedown, which doesn't seem to exist in the notices you've posted. Perhaps the links in your notices go to a different form that doesn't require a reference ID, or perhaps the ID is encoded into the links. Nosferattus (talk) 03:44, 25 March 2025 (UTC)Reply
Hello @[[User:Nosferattus|Nosferattus], thank you for your inquiry. Unfortunately, the URLs in question have not been proved to us on the notices we are receiving. We don't have any visibility into Google's internal policies but it seems reasonable to assume they have an internal policy where some takedown URLs can't be listed. I understand this is frustrating as notices without a URL are not helpful. I am working on cleaning up the public page and will be adding some additional language to explain as to why the URLs are not being posted. Please note, we do, on occasion, receive one off notices that DO list the URL. All of those will be posted as received. For record keeping purposes, running tallies of all notices received (URL or not) will be included in the number charts on the public page. I've recently been added to an archive for missing time frames where notices were not being sent to legal between 2000 - 2003/4, and the backlog of those tallied notices will be added as soon as possible. I hope this information is helpful. KFrancis (WMF) (talk) 00:07, 27 March 2025 (UTC)Reply
@KFrancis (WMF): That is not what I was asking. I don't want the takedown URLs of the media, I want the URL for filing a DMCA counter-notice at google.com. In other words, if you click the "DMCA Counter Notification" link in the original notices from Google, where does that take you? You can't click it in the PDF versions that you've supplied here since they are just images. Without that URL we can't file counter-notices (regardless of whether we have the takedown URLs or not). Nosferattus (talk) 23:52, 3 April 2025 (UTC)Reply
Thank you for the clarification. The notices we are receiving, and depending on the language, look like this:
To: Owner of https://en.m.wikipedia.org/,
Due to a request under data protection law in Europe, Google can no longer show one or more pages from your site in Google Search results. This only affects responses to some search queries for names or other personal identifiers that might appear on your pages. Only results on versions of Google’s search results for countries applying European data protection law are affected. No action is required from you.
...
Read about Google’s process for European data protection law removals.
The only clickable link refers to Google's process for European data protection law removals. The old link where one could "notify Google" is no longer on the notices we receive.
I hope this is helpful and please let me know if you have more questions. KFrancis (WMF) (talk) 21:14, 4 April 2025 (UTC)Reply
@KFrancis (WMF): No, I'm asking about DMCA take-down notices, not European right-to-be-forgotten notices. For example File:En99RTBF.pdf, which has blocked Google from linking to the Wikipedia article for Mother's Day. I can't file a counter-notice with Google unless I know where that "DMCA Counter Notice" link in File:En99RTBF.pdf (in the 4th paragraph) actually goes. (And WMF can't file the counter-notice due to the DMCA safe harbor provisions.) Nosferattus (talk) 01:34, 8 April 2025 (UTC)Reply
@KFrancis (WMF): Any update on this? Nosferattus (talk) 15:18, 11 April 2025 (UTC)Reply
Apologies for the late response as I was out of office for the last couple of weeks. The link you requested is here: https://reportcontent.google.com/landing/counter_notice?product=websearch&utm_source=wnc_594600&utm_medium=gamma&utm_campaign=wnc_594600&utm_content=msg_638303&hl=en&visit_id=638810374117637227-2986320125&rd=1
I am trying to figure out how to better post these types of notices so you may access the links when actually listed. KFrancis (WMF) (talk) 20:43, 23 April 2025 (UTC)Reply
-Following up on my last message. It looks like any links from notices received can be accessed from the "Summary" section of notices posted, so going forward, I will also add that information under the summary section for notices posted with any links to follow. KFrancis (WMF) (talk) 23:41, 23 April 2025 (UTC)Reply
Sadly, that URL won't work for me since it requires a "reference ID", which "is provided in the notification you received when we restricted your content", although no such reference ID seems to be in the notification. Apparently though, you can use https://reportcontent.google.com/forms/counter_notice?web-redirect=f&product=websearch to file a counter-notice without a reference ID. Nosferattus (talk) 20:57, 3 May 2025 (UTC)Reply