All People
US government has pit on hold plans to evaluate AI systems before their release. Cites competition with China www.nytimes.com/2026/05/21/t...
Just what I need, more whimsy from my google web search
“This flight will be full to Atlanta”
Thank god. I don’t want to be in the plane that only goes part way
I would have liked to see Sanderson’s Reckoners series as a TV series, but I’m good with this.
Musk loses court battle with OpenAI on the grounds that the statute of limitations had passed.
The 🍿 was good while it lasted
www.cnbc.com/2026/05/18/m...
Imagine getting upset over a movie that doesn’t involve Optimus Prime dying
Relative change in A grades given since the release of ChatGPT www.wsj.com/us-news/educ...
The ACL conference has put out a statement that papers with hallucinated references will be desk-rejected 2026.aclweb.org/acl_statement/
“by signing your name as an author of a paper, each author takes full responsibility for all its contents, irrespective of how the contents were generated.”
This has always been the case and this shouldn’t even need to be stated. Yet here we are.
ArXiV has a new LLM policy
(Screenshots with alt text so you don’t have to click through to the other place and see all the stupid responses)
The inaugural ACM AI Leadership Summit will be held in Atlanta, August 30-September 2. aisummit26.acm.org
It convenes researchers, practitioners, industry leaders, educators, and policymakers to explore how AI can advance science and society.
We live in a sad world in which one cannot even trust their favorite poop analysis app to not sell their data to an AI company www.404media.co/ai-poop-anal...
How are universities feeling about the agreement that Instructure reached with cyberhacker including a pinkie-swear not to use the data exfiltration for further extortion of universities, students, and faculty?
The US Department of Labor has put out a new, free AI literacy course. Princeton CITP analyzed it blog.citp.princeton.edu/2026/05/05/m...
"We found that high-quality constitutional documents combined with fictional stories portraying an aligned AI can reduce agentic misalignment" www.anthropic.com/research/tea...
Who would have thought to use stories to align LMs? Oh, it was me in 2019... 1/
AAAI used a novel AI paper reviewing system on all 22k papers submitted. In phase 1, authors received 1 clearly marked AI generated review and 1 human review. arxiv.org/abs/2604.13940
It's going to be a pin, or a pen, or earbuds, or a phone...
On this May the Fourth, let us step back for a moment to think about how, very soon, "The Mandalorian & Grogu" will supplant "Attack of the Clones" for the Star Wars movie with the cringiest title.
That viral paper on the benefits of ChatGPT in education was using unsound meta-review methodologies. This does not mean that there are no benefits or anti-benefits of AI, only that the conclusions drawn in the paper cannot be drawn www.404media.co/nature-retra...
I wrote a reference checker to see if papers I am reviewing have hallucinated references.
It's a ghastly problem. PDF-to-structured-text is still an open problem. Reference formats can vary and some are hard to parse. Even when references are correct, there can be sloppiness.
Not too long ago someone I follow introduced a citation checking tool. I cannot find it anymore (and cannot search posts from only people I follow). Can anyone point me in the right direction?
Thanks!
Okay y'all, it's official.
See you in Atlanta in December.
As someone obsessed with whether AI can play D&D, I am going to have fun with the ChatGPT goblin phenomenon for a bit. You can ignore me.
But if you do want to take it seriously, here is a paper my team and I wrote about why AI playing D&D should be an AI grand challenge: arxiv.org/abs/2509.17192
Designating that most uses of the term "goblin" and "gremlin" are not legitimate while designating most uses of the word "frog" as legitimate is the imposition of a set of values on a technology. Why do a small number of people in Silicon Valley get to decide whether goblins are inappropriate?
Reward your model for being nerdy, get nerdy behavior.
Train in your own output, pollute your dataset with verbal tics
openai.com/index/where-...
Congratulations to Dr. Gennie Mansi, for successfully defending her PhD thesis.
Dr. Mansi's work investigates AI in healthcare; how AI impacts legal liability of doctors, how AI design can interfere with delivery of care, and also we might improve the design of AI systems to account for liability
*cocks gun, steps back to the terminal*
Computer’s got goblins
M
Recent Activity44 posts
Recent Activity
Mark Riedl
@markriedl.bsky.social
SafetyEvaluation
Mark Riedl
@markriedl.bsky.social
Mark Riedl
@markriedl.bsky.social
Mark Riedl
@markriedl.bsky.social
Mark Riedl
@markriedl.bsky.social
Mark Riedl
@markriedl.bsky.social
Mark Riedl
@markriedl.bsky.social
Evaluation
Mark Riedl
@markriedl.bsky.social
Evaluation
Mark Riedl
@markriedl.bsky.social
Safety
Mark Riedl
@markriedl.bsky.social
Evaluation
Mark Riedl
@markriedl.bsky.social
Mark Riedl
@markriedl.bsky.social
SafetyDeployment
Mark Riedl
@markriedl.bsky.social
Safety
Mark Riedl
@markriedl.bsky.social
Safety
Mark Riedl
@markriedl.bsky.social
SafetyAgent
Mark Riedl
@markriedl.bsky.social
EvaluationLLM
Mark Riedl
@markriedl.bsky.social
Deployment
Mark Riedl
@markriedl.bsky.social
Mark Riedl
@markriedl.bsky.social
Evaluation
Mark Riedl
@markriedl.bsky.social
EvaluationTooling
Mark Riedl
@markriedl.bsky.social
Tooling
Mark Riedl
@markriedl.bsky.social
Mark Riedl
@markriedl.bsky.social
EvaluationLLM
Mark Riedl
@markriedl.bsky.social
EvaluationSafety
Mark Riedl
@markriedl.bsky.social
Fine-tuning
Mark Riedl
@markriedl.bsky.social
SafetyTooling
Mark Riedl
@markriedl.bsky.social
44 posts · All time