ArXiv to Ban Authors of AI-Generated Papers: Legal Implications for Academic Authorship and Copyright Law

But the world’s largest preprint server had just had enough. ArXiv has been getting an overwhelming number of papers, which some researchers are calling AI slop, for months. Now they are defending themselves by penalising in a big way. Any researcher found to have artificially generated text injected into a paper is penalized by a one-year ban. No warnings. No second chances.

Over there Thomas Dietterich is in charge of the computer science section. He has recently been seen posting on social media and has declared the law. The message was very tough. Somebody signs their name on a sheet of paper, that person is guilty for each and every word written on that paper. It could be written by humans or generated by a machine. ArXiv receives tens of thousands of submissions a month. It’s where physics and math and computer science breakthroughs become public really before they get peer reviewed. However, as the number of low-effort garbage created by large language models grew, they had no choice but to do so. A one-strike policy now has been implemented. The fact that one gets banned from arXiv for a year is a huge blow to any academic career. It essentially isolates the researcher from the world’s fastest science discussion

https://www.nature.com/articles/s41599-025-05924-3

The Signs of Lazy Science

The weird part is that ArXiv isn’t even attempting to create some high-tech tool to catch people using AI. Those tools are not effective, at best. Are too critical and correct the errors made by other honest writers. Rather, the moderators are only interested in the most awkward and glaring blunders.

Almost users are actually submitting papers with chatbot meta-comments in the middle of the text. Let’s say you read a dense paper on quantum mechanics, then you come across a sentence with the text: “Here’s a 200-word summary; would you like me to tweak it?” Or a data table with the words, This data is for illustration purposes only; please complete with actual data from your experiment. It’s just plain laziness. Dietterich described it as “indisputable evidence

Then, of course, there are the bogus citations. Things get really messy here. AI bots like to hallucinate. Ask a language model to write a lit review, and it’ll just make up academic papers that never existed. It will fill in the titles (Complete it). It will create fake authors. It will even create bogus publish dates. Columbia University researchers recently conducted a study published in The Lancet that reviewed millions of biomedical papers. They discovered that the number of fake citations has gone up 12 times since 2023. By early 2026, one in every 277 papers contained at least one completely fake reference. Any researcher who fails to look at these references submits a paper on air. This is the real reason behind the ban. The ban hammer falls right away if a moderator sees a hallucinated citation or a bot prompt that was not responded to.

Taking The Blame For The Bot

All this legal tangle comes into the light. What is the nature of the author today?

Academic authorship was the poetic construct of doing the job for centuries. You performed the experiment. The analysis was written by you. The results are next to you. The legal definition of authorship begins to bend however, when an algorithm writes half the methodology section. The rules of the institutions are very strict at present. You can’t make an artificial intelligence an author. It can’t accept a legal liability. It doesn’t comprehend morality.

So the human absorbs all the burden. However, if the text is plagiarized by this AI, the human researcher is legally accountable for plagiarism. When a pharmaceutical company uses fake data and the human author is the one that is plunging into professional disaster and possible legal troubles, it is the human author who is suffering. The only thing ArXiv is reminding researchers of is that signing a paper is a legal statement of responsibility. Don’t blame the software if it doesn’t work.

The Copyright Nightmare

IP laws are even worse. The game is playing out with a lot of concern for legal scholars.

The basic principles of copyright are that it must be the product of human authorship. IPRs are not assigned to machines. But when a large portion of the research paper is created by AI, can the researcher claim copyright for that text? Probably not. That’s a huge issue for future academic journals. If a researcher assigns the copyright to a publisher, he or she is required to ensure that the copyright is in his or her name. If they were using a bot to write it, their contract may be completely null and void.

There’s also the issue of from where the AI is getting information. LLMs are trained on millions of copyrighted books and articles. At times they retrudge it to you almost verbatim. A researcher may believe that he’s simply training the AI to repair his grammar. In fact, the bot could be copying heavily copyrighted material directly into their submission. The original content owner won’t sue the chatbot if he or she knows. They are going to file a lawsuit against the individual whose name is the first one in the PDF.

A Slower Pipeline

The new rules are the tough rules because they are tough. In March 2026, arXiv for the first time reaches the 30,000 submissions per month milestone. The volunteers are totally fatigued. They can’t lie awake at night worrying whether a paper was real science or a slick hallucination.

If a researcher does receive the one-year ban, then it does not simply expire after a year. They’re not simply restored to their old account. New papers they wish to submit to arXiv must be accepted by a respectable peer-reviewed journal first. They don’t get the same right to be published immediately.

Author

Leave a Reply

Your email address will not be published. Required fields are marked *