Mastodawn

Show thread

ancuuiqter Feb 8, 2024

Maybe you’re thinking of Sci-Hub and its founder, Alexandra Asanovna Elbakyan?

I could not find a location on Anna’s Archive’s wiki page.

Sci-Hub - Wikipedia

Show thread

ancuuiqter Feb 8, 2024

The official Anna’s Archive Reddit account, AnnaArchivist, has responded to an r/AnnasArchive post linking the same Torrent Freak article:

Thanks! We’re not making any public statements about this lawsuit but rest assured we’re fine.

AnnaArchivist (u/AnnaArchivist) - Redlib

View on Redlib, an alternative private front-end to Reddit.

Show thread

ancuuiqter Feb 8, 2024

Would you be able to share where you learned that Anna’s Archive is based in Kazakhstan?

Show thread

ancuuiqter Feb 8, 2024

Regarding the operating location(s) of Anna’s Archive, OCLC is alleging the following (pages 7-9):

C. Defendants Rely on Sophisticated Technology and Online Practices to Conceal their Identities.

Defendants understand that their pirate library enterprise and related activities, here, hacking and harvesting OCLC’s WorldCat® records, are illegal. Id. ¶¶ 11, 65, PageID 3, 11. Defendants admit that they are engaging in and facilitating mass copyright infringement, stating, “[w]e deliberately violate the copyright law in most countries.” Id. ¶ 67 PageID 12. In another blog post, Defendants noted that their activities could lead to arrest and “decades of prison time.”1 Defendants have also recognized that their hacking and distribution of OCLC’s data is improper, acknowledging that WorldCat® is a “proprietary database,” that OCLC’s “business model requires protecting their database,” and that Defendants are “giving it all away. :-).” Compl. ¶ 90, Dkt. 1 at PageID 16.

Because Defendants understand their actions infringe on copyright laws, amongst others, Defendants go to great lengths to remain anonymous to ensure both that Anna’s Archive’s domains are not taken down and to avoid the legal consequences of their actions, including civil lawsuits where parties like OCLC seek to vindicate their rights, as well as criminal and regulatory enforcement actions undertaken by government entities. None of Anna’s Archive’s domains or its online blog provide a business address, business contact, or other contact information that would be found on a legitimate entity’s website. See Kim Decl.¶ 9, attached here as Exhibit 1.

Defendants have explained in a blog post that they are “being very careful not to leave any trace [of their online activities], and having strong operational security.”2 For instance, Anna’s Archive utilizes a VPN with “[a]ctual court-tested no-log policies with long track records of protecting privacy.”3 Each of the Anna’s Archive domains are registered using foreign hosts, registrars, and registrants in order to conceal the identity of the site operators. Kim Decl. ¶¶ 7–8. Additionally, Defendants rely on multiple proxy servers to maintain anonymity.4 Defendants also use a free version of Cloudflare, a top-level hosting provider, so that they do not have to provide any payment or other identifying information.5 See also Kim Decl. ¶ 6. Defendants selected Cloudflare because they claim Cloudflare has resisted requests to take down websites for copyright infringement.6 The individuals behind Anna’s Archive also use usernames as pseudonyms to mask their identities online. Compl. ¶ 65, Dkt. 1 at PageID 11.

Through the work of a cyber security and digital forensic investigation firm, OCLC was able to identify one of the individuals behind Anna’s Archive by name and locate a United States address, Defendant Maria Dolores Anasztasia Matienzo.7 However, the physical address and contact information of Anna’s Archive and the identities and contact information of the John Does remain unknown. See id. ¶ 6–11. It is highly likely that Anna’s Archive is a non-domestic, foreign entity, based on the findings from OCLC’s investigator, as set forth below. See id.

OCLC explained the above in their Motion To Serve Defendant Anna’s Archive By Email, as justification for why they seek “permission to serve Anna’s Archive by alternative means, here, email, pursuant to Federal Rule of Civil Procedure 4(h)(2) and (f)(3).”

OCLC Online Computer Library Center, Inc. v. Anna's Archive : Free Download, Borrow, and Streaming : Internet Archive

This item represents a case in PACER, the U.S. Government's website for federal case data. If you wish to see the entire case, please consult PACER directly.

Internet Archive

Show thread

ancuuiqter Feb 8, 2024

As to how Anna’s Archive accomplished their data scraping, this is what OCLC is claiming (see page 62-63):

These attacks were accomplished with bots (automated software applications) that “scraped” and harvested data from WorldCat.org and other WorldCat®-based research sites and that called or pinged the server directly. These bots were initially masked to appear as legitimate search engine bots from Bing or Google.

To scrape or harvest the data on WorldCat.org, the bots searched WorldCat.org results, running a script based on OCN for individual JavaScript Object Notation, or “JSON,” records. As a result, WorldCat® data including freely accessible and enriched data, such as OCNs, were scraped from individual results on WorldCat.org.

The bots also harvested data from WorldCat.org by pretending to be an internet browser, directly calling or “pinging” OCLC’s servers, and bypassing the search, or user interface, of WorldCat.org. More robust WorldCat® data was harvested directly from OCLC’s servers, including enriched data not available through the WorldCat.org user interface.

Finally, WorldCat® data was harvested from a member’s website incorporating WorldCat® Discovery Services, a subscription-based variation of WorldCat.org that is available only to a member’s patrons. Again, the hacker pinged OCLC’s servers to harvest WorldCat® records directly from the servers. To do this through WorldCat® Discovery Services/FirstSearch, the hacker obtained and used the member’s credentials to authenticate the requests to the server as a member library.

From WorldCat® Discovery Services, hackers harvested 2 million richer WorldCat® records that included data not available in WorldCat.org. This hacking method resulted in the harvesting of some of OCLC’s most proprietary fields of WorldCat® data.

These hacking attacks materially affected OCLC’s production systems and servers, requiring around-the-clock efforts from November 2022 to March 2023 to attempt to limit service outages and maintain the production systems’ performance for customers. To respond to these ongoing attacks, OCLC spent over 1.4 million dollars on its systems’ infrastructure and devoted nearly 10,000 employee hours to the same.

Despite OCLC’s best efforts, OCLC’s customers experienced many significant disruptions in paid services during the aforementioned period as a result of the attacks on WorldCat.org, requiring OCLC to create system workarounds to ensure services functioned.

During this time, customers threatened and likely did cancel their products and services with OCLC due to these disruptions.

Because OCLC had to combat these persistent hacking attacks, OCLC was forced to divert existing personnel and resources from OCLC’s other products and services. As a result, OCLC’s development and improvements to other products and services were delayed and limited.

OCLC has devoted, at various times, ten or more employees to respond to and mitigate the harm from these attacks from October 2022 to present.

OCLC Online Computer Library Center, Inc. v. Anna's Archive : Free Download, Borrow, and Streaming : Internet Archive

This item represents a case in PACER, the U.S. Government's website for federal case data. If you wish to see the entire case, please consult PACER directly.

Internet Archive

Show thread

ancuuiqter Feb 7, 2024

Here are the court filings if anyone would like to read them:

archive.org/details/gov.uscourts.ohsd.287709/

The following is a link to the docket (which the above link draws from), so people can follow the progress of the lawsuit:

courtlistener.com/…/oclc-online-computer-library-…

OCLC Online Computer Library Center, Inc. v. Anna's Archive : Free Download, Borrow, and Streaming : Internet Archive

This item represents a case in PACER, the U.S. Government's website for federal case data. If you wish to see the entire case, please consult PACER directly.

Internet Archive

ancuuiqter Feb 7, 2024

Lawsuit Accuses Anna's Archive of Hacking WorldCat, Stealing 2.2 TB Data * TorrentFreak

https://lemmy.world/post/11684441

Lawsuit Accuses Anna's Archive of Hacking WorldCat, Stealing 2.2 TB Data - Lemmy.World

> American nonprofit OCLC is known globally for its leading database of bibliographic records, WorldCat. A few months ago, many of these records were posted publicly by the shadow library search engine, Anna’s Archive. OCLC believes that this is the result of a year-long hack and, with a lawsuit filed at an Ohio federal court, it demands damages. > WorldCat Sues Anna’s Archive > > It is no secret that publishers fiercely oppose the search engine’s stated goals. The same also applies to OCLC, which has now elevated its concerns into a full-blown lawsuit, filed this month at a federal court in Ohio. > > The complaint accuses Washington citizen Maria Dolores Anasztasia Matienzo and several “John Does” of operating the search engine and scraping WorldCat data. The scraping is equated to a cyberattack by OCLC and started around the time Anna’s Archive launched. > > “Beginning in the fall of 2022, OCLC began experiencing cyberattacks on WorldCat.org [http://WorldCat.org] and OCLC’s servers that significantly affected the speed and operations of WorldCat.org [http://WorldCat.org], other OCLC products and services, and OCLC’s servers and network infrastructure,” OCLC’s complaint notes. > > “These attacks continued throughout the following year, forcing OCLC to devote significant time and resources toward non-routine network infrastructure enhancements, maintenance, and troubleshooting.” > > The non-profit says that it spent roughly $68 million over the past two years developing and enhancing WorldCat records, which are an essential part of its operation. Having a copy of the data publicly available through Anna’s Archive is a direct threat to its business. > > OCLC claims that Anna’s Archive unmasked itself as the “perpetrator of the attacks on WorldCat.org [http://WorldCat.org]” when it publicly announced its scraping effort. This includes a detailed blog post the operators published on the matter, encouraging the public to use the scraped data. > In addition to harvesting data from WorldCat.org [http://WorldCat.org], the defendants are also accused of obtaining and using credentials of a member library to access WorldCat Discovery Services. This opened the door to yet more detailed records that are not available on WorldCat.org [http://WorldCat.org]. > > OCLC says that it spent significant time and resources to address the ‘attacks’ on its systems. > > “These hacking attacks materially affected OCLC’s production systems and servers, requiring around-the-clock efforts from November 2022 to March 2023 to attempt to limit service outages and maintain the production systems’ performance for customers. > > “To respond to these ongoing attacks, OCLC spent over 1.4 million dollars on its systems’ infrastructure and devoted nearly 10,000 employee hours to the same,” the complaint adds.

ancuuiqter Nov 12, 2023

Z-Library Blog: "Unprecedented seizure of our domains with books on rare languages"

https://lemmy.world/post/8146944

Z-Library Blog: "Unprecedented seizure of our domains with books on rare languages" - Lemmy.World

> Today we are forced to share some sad news - yesterday many of our domains were seized again. We should highlight that the majority of the seized domains were not mirrors of the Z-Library website. Instead, they were separate sub-projects, containing only books in rare languages of the world, and their blocking is perplexing. For instance, these domains included books in Tamil, Mongolian, Catalan, Urdu, Pashto, and other languages: > > afrikaans-books.org [http://afrikaans-books.org] > > bengali-books.org [http://bengali-books.org] > > urdu-books.org [http://urdu-books.org] > > marathi-books.org [http://marathi-books.org] > > chamorro-books.org [http://chamorro-books.org] > > Over the 15 years of the project’s existence, we’ve managed to collect an impressive collection of rare texts in many uncommon languages. These domains featured many unique texts that can’t be found anywhere else, including rare books, documents, and manuscripts. All of this is a priceless heritage, contributing to the preservation and study of world cultures, and serving as important material for researchers in linguistics, anthropology, and history. Z-Library also states in the blog post that they did not lose the files, just the domains.

ancuuiqter Oct 10, 2023

2 companies caught illegally printing over 15,000 books, calendars in Ho Chi Minh City, Vietnam

https://lemmy.world/post/6609287

2 companies caught illegally printing over 15,000 books, calendars in Ho Chi Minh City, Vietnam - Lemmy.world

>Authorities have caught two Ho Chi Minh City-based firms printing more than 15,000 pirated copies of books and 2024 calendars, with a combined weight exceeding 15 metric tons. > >Police officers from the Ministry of Public Security and the city, in coordination with inspectors from the municipal Department of Information and Communications on Monday morning raided Kien A Packing Production and Trading Service Company in the city’s outlying Cu Chi District. > >The company was caught with 3,000 illegally printed copies of ‘Kinh Truong Tho diet toi’ (Long-life sutra destroys sins) from Ton giao (religion) Publishing House and 9,000 illegally printed copies of ‘Sherlock Holmes’ from the Writers’ Association Publishing House. The combined weight of the books was 10 metric tons. > >Authorities had not given their approval for the books to be printed. All of the illegally copies have since been seized.

Show thread

ancuuiqter Jul 11, 2023

Mentioning this since the project Anna’s Archive compiles several datasets and their corresponding torrents.

Anna’s Archive, whose aim is to “archive all the books in the world, and make them widely accessible,” pulls from a number of shadow library sources; the project provides its own torrent links (via Tor) for Library Genesis, Z-lib, Internet Archive, among others, plus Library Genesis’s torrents. In the datasets linked below, you can click on a given source and find its onion site or the torrents provided by the shadow library itself (in the case of Library Genesis, for example).

Anna’s Archive datasets

…almost all files shown on Anna’s Archive are available through torrents. Below is a list of the different data sources that we use, with links to their torrents. Our own torrents are available on Tor.

Sources include

Internet Archive Digital Lending Library
Libgen.li comics
Z-Library scrape
ISBNdb scrape
Libgen auxiliary data
Libgen.rs
Libgen.li (includes Sci-Hub)

Datasets - Anna’s Archive

The world’s largest open-source open-data library. Mirrors Sci-Hub, Library Genesis, Z-Library, and more.