All People
N

Naomi Saphra

ML/NLP professor

Recent Activity16 posts · 62 x-posts

Recent Activity

grep SOURCE=
Naomi Saphra

@NaomiSaphra

what a perfect space for scientific discourse! I'll start off with a few images of myself

Highlights: Naomi Saphra humorously comments on a space for scientific discourse, offering to share images of herself.

Worth reading: Shows her playful side and engagement with the scientific community.

LLM
Naomi Saphra

@NaomiSaphra

Life update: I'm starting as faculty at Boston University in 2026! BU has SCHEMES for LM interpretability & analysis, so I couldn't be more pumped to join a

Highlights: Naomi Saphra announces her upcoming faculty position at Boston University in 2026, excited about their work on language model interpretability.

Worth reading: Highlights her career move and research focus on LM interpretability.

LLMEvaluation
Naomi Saphra

@NaomiSaphra

New preprint! Phase transitions! We love to see them during LM training.

Highlights: Announces a new preprint about phase transitions in language model training.

Worth reading: Relevant for researchers interested in training dynamics and phase transitions in LLMs.

LLMFine-tuning
Naomi Saphra

@NaomiSaphra

Life update: I'm starting as faculty at Boston University in 2026! BU has SCHEMES for LM interpretability & analysis, so I couldn't be more pumped to join a

Highlights: Announces new faculty position at Boston University focusing on LM interpretability.

Worth reading: Highlights career move and BU's research initiatives in interpretability.

EvaluationSafety
Naomi Saphra

@NaomiSaphra

New preprint! Everyone loves causal interp. It's coherently defined! It makes testable predictions

Highlights: Naomi Saphra announces a new preprint on causal interpretability, emphasizing its coherent definition and testable predictions.

Worth reading: Highlights a key development in causal interpretability research.

Evaluation
Naomi Saphra

@NaomiSaphra

Waiting on a robot body. All opinions are universal and held by both employers and family. Now a dedicated grok hate account. Accepting ML/NLP PhD students.

Highlights: Bio/profile text indicating Saphra's role as a faculty member accepting PhD students and their stance on Grok.

Worth reading: Provides context on Saphra's professional status and interests.

LLM
Naomi Saphra

@NaomiSaphra

Life update: I'm starting as faculty at Boston University in 2026! BU has SCHEMES for LM interpretability & analysis, so I couldn't be more pumped to join a

Highlights: Announcement of starting as faculty at Boston University in 2026, focusing on LM interpretability.

Worth reading: Important career update and indication of research focus.

Evaluation
Naomi Saphra

@NaomiSaphra

what a perfect space for scientific discourse! I'll start off with a few images of myself

Highlights: Saphra humorously comments on a space for scientific discourse with self-deprecating tone.

Worth reading: Shows Saphra's playful side and engagement with scientific community.

Evaluation
Naomi Saphra

@NaomiSaphra

Perfect cute light very short read for a break in a deadline crunch.

Highlights: Saphra recommends a short, light read for a break during intense work.

Worth reading: Provides insight into Saphra's reading recommendations and work-life balance.

LLM
Naomi Saphra

@NaomiSaphra

what a perfect space for scientific discourse! I'll start off with a few images of myself

Highlights: Sarcastic comment about using images of oneself for scientific discourse.

Worth reading: Shows Saphra's humorous take on social media as a platform for science.

Safety
Naomi Saphra

@NaomiSaphra

Life update: I'm starting as faculty at Boston University in 2026! BU ...

Highlights: Announces starting as faculty at Boston University in 2026.

Worth reading: Key career milestone for a prominent AI interpretability researcher.

Evaluation
nsaphra.bsky.social
Naomi Saphra

@nsaphra.bsky.social

I will ALWAYS read the youtube comments
May 18, 01:39 AM·❤️ 18🔄 1·💬 1
Naomi Saphra

@NaomiSaphra

New preprint! Everyone loves causal interp. It's coherently defined! It makes testable predictions

Highlights: Announces a new preprint on causal interpretation, emphasizing its coherent definition and testable predictions.

Worth reading: Relevant for understanding recent advances in mechanistic interpretability of language models.

EvaluationSafety
Naomi Saphra

@NaomiSaphra

what a perfect space for scientific discourse! I'll start off with a few images of myself

Highlights: Naomi Saphra humorously comments on a space for scientific discourse, starting with images of herself.

Worth reading: Shows her playful engagement with the scientific community on X.

LLM
Naomi Saphra

@NaomiSaphra

Life update: I'm starting as faculty at Boston University in 2026! BU ...

Highlights: Naomi Saphra announces her upcoming faculty position at Boston University in 2026.

Worth reading: Highlights a major career milestone for a prominent AI researcher.

LLM
Naomi Saphra

@NaomiSaphra

what a perfect space for scientific discourse! I'll start off with a few images of myself

Highlights: Naomi Saphra humorously comments on using images of herself for scientific discourse.

Worth reading: Shows her playful engagement with the platform.

LLM
Naomi Saphra

@NaomiSaphra

Life update: I'm starting as faculty at Boston University in 2026! BU ...

Highlights: Naomi Saphra announces her new faculty position at Boston University starting in 2026.

Worth reading: Key career milestone for a prominent AI researcher.

LLM
Naomi Saphra

@NaomiSaphra

what a perfect space for scientific discourse! I'll start off with a few images of myself

Highlights: Naomi Saphra humorously comments on using images of herself in a scientific discourse space.

Worth reading: Shows her playful engagement with online scientific communities.

LLM
Naomi Saphra

@NaomiSaphra

Life update: I'm starting as faculty at Boston University in 2026! BU ...

Highlights: Announces her upcoming faculty position at Boston University in 2026.

Worth reading: Key career milestone for a prominent AI researcher.

LLM
Naomi Saphra

@NaomiSaphra

New preprint! Everyone loves causal interp. It's coherently defined! It makes testable predictions

Highlights: Naomi Saphra announces a new preprint on causal interpretability, emphasizing its coherent definition and testable predictions.

Worth reading: Highlights a new contribution to causal interpretability in AI, a key area for understanding model behavior.

Evaluation
Naomi Saphra

@NaomiSaphra

RT @natolambert: A few facts, while the dust is settling. Ai2 still is... - releasing open models, folks want to,

Highlights: Naomi Saphra retweets a post about AI2 continuing to release open models despite industry changes.

Worth reading: Shows engagement with open model releases and industry dynamics.

Infra
Naomi Saphra

@NaomiSaphra

I work on understanding and improving training for NLP models, with a focus on studying how structures and mechanistic behaviors emerge over the ...

Highlights: Describes her research focus on training dynamics and mechanistic interpretability in NLP models.

Worth reading: Summarizes her research interests.

LLMEvaluation
Naomi Saphra

@NaomiSaphra

what a perfect space for scientific discourse! I'll start off with a few images of myself

Highlights: Sarcastic comment about scientific discourse, possibly related to AI.

Worth reading: Shows her critical perspective on AI discourse.

Safety
Naomi Saphra

@NaomiSaphra

New preprint! Everyone loves causal interp. It's coherently defined! It makes testable predictions

Highlights: Naomi Saphra announces a new preprint on causal interpretability, emphasizing its coherent definition and testable predictions.

Worth reading: Highlights a key advancement in AI interpretability research.

Evaluation
Naomi Saphra

@NaomiSaphra

Ok, I wrote this up (link below)

Highlights: Naomi Saphra shares that she has written up something, presumably a blog post or paper.

Worth reading: Indicates a new written work by the researcher.

LLM
Naomi Saphra

@NaomiSaphra

New preprint! Everyone loves causal interp. It's coherently defined! It makes testable predictions

Highlights: Announces a new preprint on causal interpretability, emphasizing its coherent definition and testable predictions.

Worth reading: Highlights a new contribution to AI interpretability research.

LLMEvaluation
Naomi Saphra

@NaomiSaphra

New preprint! Everyone loves causal interp. It's coherently defined! It makes testable predictions

Highlights: Naomi Saphra announces a new preprint on causal interpretability, emphasizing its coherent definition and testable predictions.

Worth reading: Highlights ongoing work in AI interpretability, a key area for understanding and trusting AI models.

SafetyEvaluation
Naomi Saphra

@NaomiSaphra

what a perfect space for scientific discourse! I'll start off with a few images of myself

Highlights: Naomi Saphra humorously comments on using self-images to initiate scientific discourse.

Worth reading: Shows her playful engagement with the X platform.

Safety
Naomi Saphra

@NaomiSaphra

Life update: I'm starting as faculty at Boston University in 2026! BU ...

Highlights: Naomi Saphra announces her new faculty position at Boston University starting in 2026.

Worth reading: Highlights her career move into academia.

LLM
nsaphra.bsky.social
Naomi Saphra

@nsaphra.bsky.social

May 8, 02:12 AM·❤️ 3🔄 0
EvaluationSafety
Naomi Saphra

@NaomiSaphra

what a perfect space for scientific discourse! I'll start off with a few images of myself

Highlights: Sarcastic comment about scientific discourse on social media.

Worth reading: Reflects her critical perspective on online discussions.

Safety
Naomi Saphra

@NaomiSaphra

I work on understanding and improving training for NLP models, with a focus on studying how structures and mechanistic behaviors emerge over the

Highlights: Describes her research focus on NLP model training and emergent behaviors.

Worth reading: Summarizes her research interests in interpretability.

LLMFine-tuning
Naomi Saphra

@NaomiSaphra

New preprint! Everyone loves causal interp. It's coherently defined! It makes testable predictions

Highlights: Announces a new preprint on causal interpretation, emphasizing its coherent definition and testable predictions.

Worth reading: Relevant for researchers interested in mechanistic interpretability and causal methods in AI.

Evaluation
Naomi Saphra

@NaomiSaphra

Ok, I wrote this up (link below)

Highlights: Indicates a write-up on a topic, with a link to further content.

Worth reading: May contain insights on AI training or interpretability, given Saphra's research focus.

LLM
Naomi Saphra

@NaomiSaphra

I work on understanding and improving training for NLP models, with a focus on studying how structures and mechanistic behaviors emerge over the

Highlights: Describes research focus on NLP model training and emergence of mechanistic behaviors.

Worth reading: Provides context on Saphra's research interests in mechanistic interpretability and training dynamics.

LLMEvaluation
Naomi Saphra

@NaomiSaphra

I work on understanding and improving training for NLP models, with a focus on studying how structures and mechanistic behaviors emerge over the

Highlights: Naomi Saphra describes her research focus on understanding and improving NLP model training, specifically how structures and mechanistic behaviors emerge.

Worth reading: Provides insight into the author's research interests in mechanistic interpretability and training dynamics.

LLMFine-tuningEvaluation
Naomi Saphra

@NaomiSaphra

Naomi Saphra (@nsaphra). 237 likes. New preprint! Everyone loves causal interp. It's coherently defined! It makes testable predictions

Highlights: Announces a new preprint on causal interpretability, emphasizing its coherent definition and testable predictions.

Worth reading: Highlights a new contribution to causal interpretability, a key area in AI safety and mechanistic understanding.

SafetyEvaluationLLM
Naomi Saphra

@NaomiSaphra

Just got a desk reject, post-rebuttals, for a paper being submitted to arxiv <30 min late for

Highlights: Naomi Saphra shares an experience of receiving a desk reject after rebuttals due to a paper being submitted to arXiv less than 30 minutes late.

Worth reading: Illustrates the strictness of conference deadlines and the challenges in academic publishing.

EvaluationFine-tuning
Naomi Saphra

@NaomiSaphra

what a perfect space for scientific discourse! I'll start off with a few images of myself

Highlights: Sarcastic comment about using images of herself for scientific discourse.

Worth reading: Shows her humorous take on online scientific discussions.

Evaluation
Naomi Saphra

@NaomiSaphra

New preprint! Everyone loves causal interp. It's coherently defined! It makes testable predictions

Highlights: Announces a new preprint on causal interpretability, emphasizing its coherent definition and testable predictions.

Worth reading: Highlights a contribution to mechanistic interpretability, a key area in AI safety.

Safety
Naomi Saphra

@NaomiSaphra

I work on understanding and improving training for NLP models, with a focus on studying how structures and mechanistic behaviors emerge over the

Highlights: Describes research focus on training dynamics and emergence of structures in NLP models.

Worth reading: Provides insight into the researcher's expertise in mechanistic interpretability and training.

LLMFine-tuning
Naomi Saphra

@NaomiSaphra

I work on understanding and improving training for NLP models, with a focus on studying how structures and mechanistic behaviors emerge over the

Highlights: Naomi Saphra describes her research focus on understanding and improving NLP model training, particularly how structures and mechanistic behaviors emerge.

Worth reading: Provides insight into the research interests of a prominent NLP/ML researcher.

Fine-tuning
Naomi Saphra

@NaomiSaphra

New preprint! Everyone loves causal interp. It's coherently defined! It makes testable predictions

Highlights: Announces a new preprint on causal interpretability, emphasizing its coherent definition and testable predictions.

Worth reading: Highlights a new contribution to causal interpretability in ML.

Evaluation
Naomi Saphra

@NaomiSaphra

This book starts like it's gonna be a fun microhistory of TB (it gave us the Stetson!

Highlights: Naomi Saphra comments on a book about tuberculosis, noting its engaging start.

Worth reading: Shows a personal interest outside of AI/ML.

Naomi Saphra

@NaomiSaphra

what a perfect space for scientific discourse! I'll start off with a few images of myself

Highlights: Sarcastic comment about using images for scientific discourse.

Worth reading: Illustrates her humorous take on online discussions.

LLM
Naomi Saphra

@NaomiSaphra

I work on understanding and improving training for NLP models, with a focus on studying how structures and mechanistic behaviors emerge over the

Highlights: Describes her research focus on NLP model training and emergent behaviors.

Worth reading: Summarizes her research interests.

LLMFine-tuning
Naomi Saphra

@NaomiSaphra

what a perfect space for scientific discourse! I'll start off with a few images of myself

Highlights: Sarcastic comment about using images of oneself for scientific discourse.

Worth reading: Shows her humorous take on online scientific communication.

Safety
Naomi Saphra

@NaomiSaphra

Life update: I'm starting as faculty at Boston University in 2026! BU ...

Highlights: Announcement of starting as faculty at Boston University in 2026.

Worth reading: Highlights her career move into academia.

LLM
Naomi Saphra

@NaomiSaphra

what a perfect space for scientific discourse! I'll start off with a few images of myself

Highlights: Sarcastic comment about using images for scientific discourse.

Worth reading: Shows her humorous take on academic discussions.

Safety
Naomi Saphra

@NaomiSaphra

Life update: I'm starting as faculty at Boston University in 2026! BU ...

Highlights: Announcement of new faculty position at Boston University.

Worth reading: Highlights her career move and impact on AI research.

LLM
Naomi Saphra

@NaomiSaphra

I work on understanding and improving training for NLP models, with a focus on studying how structures and mechanistic behaviors emerge over the

Highlights: Describes her research focus on NLP model training and mechanistic behaviors.

Worth reading: Summarizes her research interests in AI interpretability.

LLMFine-tuning
Naomi Saphra

@NaomiSaphra

what a perfect space for scientific discourse! I'll start off with a few images of myself

Highlights: Sarcastic comment about using images in a scientific discourse space.

Worth reading: Shows her humorous take on academic communication.

LLM
Naomi Saphra

@NaomiSaphra

Life update: I'm starting as faculty at Boston University in 2026! BU ...

Highlights: Announcement of joining Boston University as faculty in 2026.

Worth reading: Key career milestone for a prominent AI researcher.

LLM
Naomi Saphra

@NaomiSaphra

I work on understanding and improving training for NLP models, with a focus on studying how structures and mechanistic behaviors emerge over the

Highlights: Describes her research focus on NLP model training and emergent behaviors.

Worth reading: Summarizes her research interests in mechanistic interpretability.

LLMEvaluation
Naomi Saphra

@NaomiSaphra

This book starts like it's gonna be a fun microhistory of TB (it gave us the Stetson!

Highlights: Naomi Saphra comments on a book about tuberculosis, noting its engaging start.

Worth reading: Shows her casual reading interests.

Naomi Saphra

@NaomiSaphra

New preprint! Everyone loves causal interp. It's coherently defined! It makes testable predictions

Highlights: Announces a new preprint on causal interpretation, emphasizing its coherence and testability.

Worth reading: Highlights her research focus on causal interpretability.

Evaluation
Naomi Saphra

@NaomiSaphra

New preprint! Phase transitions! We love to see them during LM training.

Highlights: Announces a new preprint about phase transitions in language model training.

Worth reading: Relevant to understanding training dynamics of LLMs.

LLM
Naomi Saphra

@NaomiSaphra

Life update: I'm starting as faculty at Boston University in 2026! BU ...

Highlights: Naomi Saphra announces starting as faculty at Boston University in 2026.

Worth reading: Shows career move and continued involvement in academia.

LLMEvaluation
Naomi Saphra

@NaomiSaphra

Life update: I'm starting as faculty at Boston University in 2026! BU ...

Highlights: Naomi Saphra announces she will join Boston University as faculty in 2026.

Worth reading: Shows career move of a prominent ML/NLP researcher.

LLMFine-tuning
Naomi Saphra

@NaomiSaphra

New preprint! Everyone loves causal interp. It's coherently defined! It makes testable predictions

Highlights: Naomi announces a new preprint on causal interpretation, emphasizing its coherent definition and testable predictions.

Worth reading: Highlights a new contribution to mechanistic interpretability, a key area in AI safety.

SafetyEvaluation
Naomi Saphra

@NaomiSaphra

Life update: I'm starting as faculty at Boston University in 2026! BU ...

Highlights: Naomi Saphra announces she will join Boston University as faculty in 2026.

Worth reading: Shows a career milestone for a prominent AI researcher.

LLM
Naomi Saphra

@NaomiSaphra

Ok, I wrote this up (link below)

Highlights: Naomi references a write-up she authored, likely a blog post or paper.

Worth reading: Indicates a new piece of writing, possibly expanding on her research.

Evaluation
Naomi Saphra

@NaomiSaphra

I work on understanding and improving training for NLP models, with a focus on studying how structures and mechanistic behaviors emerge over the

Highlights: Naomi Saphra describes her research on understanding and improving NLP model training, focusing on emergent structures and mechanistic behaviors.

Worth reading: Provides insight into her research focus on mechanistic interpretability in language models.

LLMSafety
Naomi Saphra

@NaomiSaphra

I work on understanding and improving training for NLP models, with a focus on studying how structures and mechanistic behaviors emerge over the

Highlights: Naomi describes her research focus on training dynamics and emergence of mechanistic behaviors in NLP models.

Worth reading: Provides context on her research agenda in mechanistic interpretability and training dynamics.

LLMEvaluation
16 posts · 62 x-posts · All time