Elsevier
Elsevier
No point discussing this if neither of us is going to prove it one way or the other.
Bitmaps are actually a key part of what I was thinking about, so you agree with me there it seems. There’s also the issue of using the wrong paper size. .IIRC Windows usually defaults to Letter for printing even in places where A4 is the only common size and no one has heard of Letter, and most people don’t realise their prints are cropped/resized. This would still apply when printing to PDF.
You’re pushing it through one system that converts a PDF file into printer instructions, and then through another system that converts printer instructions into a PDF file. Each step probably has to make adjustments with the data it’s pushing through.
Without looking deeply into the systems involved, I have to assume it’s not a lossless process.
They maintain a high quality but not lossless.
As a trivial example, if you use the wrong paper size (like Letter instead of A4) then it might crop parts of the page or add borders or resize everything. Again I’ll admit, in 99% of cases it doesn’t matter, but it might matter if, say, an embedded picture was meant to be exactly to scale.
Those printer instructions are called Postscript and they’re the basis of PDF.
You are thinking that the printing process will rasterize the PDF and then essentially OCR/vector map it back. It’s (usually) not that complicated.
I don’t understand the “that’s no how PDFs work” criticism.
Removing data from the original file is the whole point of the exercise! Of course unique tokens can be hidden in plain sight in images, letter spacing, etc. If we want to make sure to remove that we need to degrade the quality of the PDF so that this information is lost in said lossy conversion.
I feel like most of the academia in the research side would be happy to see it collapse, but the current system is too deeply tied in the money for any quick change
I worked in academia for almost a decade and never met a researcher who wouldn’t openly support sci-hub (well, some warned their students that it was illegal to type these spesific search terms and click on the wrong link downloading the pdf for free)
Yep. But that is all a part of the problem. If academics can’t organise themselves enough to have some influence over something which is basically owned and run them already (they write the papers and then review the papers and then are the ones reading and citing the papers and caring the most about the quality and popularity of the papers) … then they can’t be trusted to ensure the quality of their practice and institutions going forward, especially under the ever increasing encroachment of capitalistic forces.
Modern day academics are damn well lucky that they inherited a system and culture that developed some old aristocratic ideals into a set of conventions and practices!
It’s chicken/egg or “you first” problem.
You spend on your work. You probably have loans. Your income is pitiful. And this is the structural thing that gets you out. Now someone says “hey take a risk, don’t do it and break the system.”
Well…you first 🤷♂️
There are a couple things we can do:
Fully agree but I can tell you about point 1 that there enough gullible scientists in the world that see nothing wrong with the current system.
They will gadly pick up free review when Nature comes knocking, since its “such an honour” for such a reputable paper.
This is probably the avenue to shut this down. If funding is contingent on making the publication freely available to download, and that comes from a major government funding source, then this whole scam could die essentially overnight.
That would need to somehow get enough political support to pass muster in the first place and pass the inevitable legal challenge that follows, too. So, really, this is just another example of regulatory capture ruining everything.
i hear you, but this leaves this massive gaping hole very quickly filled by predatory journals
the better solution would be journals created and maintained by universities or other institutions with national (or international, like from EU) funding
I’m sympathetic, but to a limit.
There are a lot of academics out there with a good amount of clout and who are relatively safe. I don’t think I’ve heard of anything remotely worthy on these topics from any researcher with clout, publicly at least. Even privately (I used to be in academia), my feeling was most don’t even know how to think and talk about it, in large part because I don’t think they do think and talk about it all.
And that’s because most academics are frankly shit at thinking and engaging on collective and systematic issues. Many just do not want to, and instead want to embrace the whole “I live and work in an ideal white tower disconnected from society because what I do is bigger than society”. Many get their dopamine kicks from the publication system and don’t think about how that’s not a good thing. Seriously, they don’t deserve as much sympathy as you might think … academia can be a surprisingly childish place. That the publication system came to be at all is proof of that frankly, where they were all duped by someone feeding them ego-dopamine hits. It’s honestly kinda sad.
I’m sympathetic but to a limit
That’s all I’m saying 🤷♂️
Publishing comes with IP laws and copyright. For example, open access articles should be easy to upload without concern. “Private” articles being republished somewhere without license is “piracy”, and ResearchGate did get in trouble for it. It’s complicated. www.chemistryworld.com/news/…/4018095.article
Pre-prints are a different story.
I kind of assume this with any digital media. Games, music, ebooks, stock videos, whatever - embedding a tiny unique ID is very easy and can allow publishers to track down leakers/pirates.
Honestly, even though as a consumer I don’t like it, I don’t mind it that much. Doesn’t seem right to take the extreme position of “publishers should not be allowed to have ANY way of finding out who is leaking things”. There needs to be a balance.
Online phone-home DRM is a huge fuck no, but a benign little piece of metadata that doesn’t interact with anything and can’t be used to spy on me? Whatever, I can accept it.
That’s true. I was actually thinking/talking about this practice in general, not specifically with regards to Elsevier.
I definitely agree that scientific journals as they are today are unacceptable.
It can be used to spy on any decent scientist who will send papers his/hers/theirs institution has access to, but their friend doesn’t.
By “spy” I mean things like: know how many times I’ve read the PDF, when I’ve opened it, which parts of it I’ve read most, what program I used to open the PDF, how many copies of the PDF I’ve made, how many people I’ve emailed it to, etc. etc. etc.
This technique can do none of that. The only thing it can do is: if someone uploads the PDF to a mass sharing network, and an employee of the publisher downloads it from that mass sharing network and compares this metadata with the internal database, then they can see which of their users uploaded it and when they originally downloaded the PDF. It tells them nothing about how it got there. Maybe the original user shared it with 20 of their colleagues (a legitimate use of a downloaded PDF), and one of those colleagues uploaded that file to the mass sharing site without telling the original downloader. It doesn’t prove one way or the other. It’s an extremely small amount of information that’s only useful for catching systemic uploaders, e.g. a single user who has uploaded hundreds or thousands of PDFs that they downloaded from the publisher using the same account.
And a savvy user can always strip that metadata out.
As a reminder, …
All true, and fucked up, but it’s not related to what I was talking about. I was talking about the general use of this technique.
Doesn’t seem right to take the extreme position of “publishers should not be allowed to have ANY way of finding out who is leaking things”. There needs to be a balance.
Nah, fuck that; that’s both the opposite of an extreme position and is exactly the one we should take!
Copyright itself is a privilege and only exists in the first place “to promote the progress of science and the useful arts.” Any entity that doesn’t respect that purpose doesn’t deserve to benefit from it at all.