New AI tool identifies 1,000 'questionable' scientific journals

Ìę
A team of computer scientists led by the ÁńÁ«ÊÓÆ”18 has developed a new artificial intelligence platform that automatically seeks out âquestionableâ scientific journals.
The study, in the journal âScience Advances,â tackles an alarming trend in the world of research.
Daniel Acuña, lead author of the study and associate professor in the Department of Computer Science, gets a reminder of that several times a week in his email inbox: These spam messages come from people who purport to be editors at scientific journals, usually ones Acuña has never heard of, and offer to publish his papersâfor a hefty fee.
Such publications are sometimes referred to as âpredatoryâ journals. They target scientists, convincing them to pay hundreds or even thousands of dollars to publish their research without proper vetting.

Daniel Acuña
âThere has been a growing effort among scientists and organizations to vet these journals,â Acuña said. âBut itâs like whack-a-mole. You catch one, and then another appears, usually from the same company. They just create a new website and come up with a new name.â
His groupâs new AI tool automatically screens scientific journals, evaluating their websites and other online data for certain criteria: Do the journals have an editorial board featuring established researchers? Do their websites contain a lot of grammatical errors?
Acuña emphasizes that the tool isnât perfect. Ultimately, he thinks human experts, not machines, should make the final call on whether a journal is reputable.
But in an era when prominent figures are questioning the legitimacy of science, stopping the spread of questionable publications has become more important than ever before, he said.
âIn science, you donât start from scratch. You build on top of the research of others,â Acuña said. âSo if the foundation of that tower crumbles, then the entire thing collapses.â
The shake down
When scientists submit a new study to a reputable publication, that study usually undergoes a practice called peer review. Outside experts read the study and evaluate it for qualityâor, at least, thatâs the goal. Ìę
A growing number of companies have sought to circumvent that process to turn a profit. In 2009, Jeffrey Beall, a librarian at CU Denver, coined the phrase âpredatoryâ journals to describe these publications.
Often, they target researchers outside of the United States and Europe, such as in China, India and Iranâcountries where scientific institutions may be young, and the pressure and incentives for researchers to publish are high.
âThey will say, âIf you pay $500 or $1,000, we will review your paper,ââ Acuña said. âIn reality, they donât provide any service. They just take the PDF and post it on their website.â
A few different groups have sought to curb the practice. Among them is a nonprofit organization called the (DOAJ). Since 2003, volunteers at the DOAJ have flagged thousands of journals as suspicious based on six criteria. (Reputable publications, for example, tend to include a detailed description of their peer review policies on their websites.)
But keeping pace with the spread of those publications has been daunting for humans.
To speed up the process, Acuña and his colleagues turned to AI. The team trained its system using the DOAJâs data, then asked the AI to sift through a list of nearly 15,200 open-access journals on the internet.
Among those journals, the AI initially flagged more than 1,400 as potentially problematic.
Acuña and his colleagues asked human experts to review a subset of the suspicious journals. The AI made mistakes, according to the humans, flagging an estimated 350 publications as questionable when they were likely legitimate. That still left more than 1,000 journals that the researchers identified as questionable.
âI think this should be used as a helper to prescreen large numbers of journals,â he said. âBut human professionals should do the final analysis.â
A firewall for science
Acuña added that the researchers didn't want their system to be a "black box" like some other AI platforms.
âWith ChatGPT, for example, you often donât understand why itâs suggesting something,â Acuña said. âWe tried to make ours as interpretable as possible.â
The team discovered, for example, that questionable journals published an unusually high number of articles. They also included authors with a larger number of affiliations than more legitimate journals, and authors who cited their own research, rather than the research of other scientists, to an unusually high level.
ÌęÌęBeyond the story
Our research impact by the numbers:
- 45 U.S. patents issued for CU inventions through Venture Partners in 2023â24
- 35 startups launched based on university innovations in 2023â24
- $1.2 billion raised by companies built on CU Boulder innovations in 2022â24
The new AI system isnât publicly accessible, but the researchers hope to make it available to universities and publishing companies soon. Acuña sees the tool as one way that researchers can protect their fields from bad dataâwhat he calls a âfirewall for science.â
âAs a computer scientist, I often give the example of when a new smartphone comes out,â he said. âWe know the phone's software will have flaws, and we expect bug fixes to come in the future. We should probably do the same with science.â
Co-authors on the study included Han Zhuang at the Eastern Institute of Technology in China and Lizheng Liang at Syracuse University in the United States.
Ìę