Anna Alexandra Grigoryan – Medium

Anna Alexandra Grigoryan

When Benchmarks Lie: Why Contamination Breaks LLM Evaluation

Mar 30

When Benchmarks Lie: Why Contamination Breaks LLM Evaluation

Mar 30

Let’s Talk Jailbreaking: Siege and the Future of Multi-Turn LLM Exploitation

Beyond One-Shot Jailbreaking: Why Multi-Turn Attacks Matter

Mar 15

Let’s Talk Jailbreaking: Siege and the Future of Multi-Turn LLM Exploitation

Mar 15

Atom of Thoughts (AOT) — A Markovian Take on LLM Reasoning

LLMs have seen major improvements with training-time scaling — increasing parameters and training data boosts performance. But once…

Mar 4

Atom of Thoughts (AOT) — A Markovian Take on LLM Reasoning

Mar 4

AdaptiveStep — Smarter Stepwise Reasoning in LLMs with Confidence-Based Division

One of the key challenges in deploying large language models (LLMs) for reasoning-intensive tasks — whether in mathematics, coding, or…

Mar 3

AdaptiveStep — Smarter Stepwise Reasoning in LLMs with Confidence-Based Division

Mar 3

ARMAP: Scaling Autonomous Agents via Automatic Reward Modeling and Planning

LLMs struggle with multi-step decision-making and real-world interaction. The ARMAP framework introduced by Chen et al. (2025) and…

Feb 23

ARMAP: Scaling Autonomous Agents via Automatic Reward Modeling and Planning

Feb 23

Structured Retrieval Orchestration: Why Multi-Agent Systems Need More Than RAG

The Limitation of RAG as an Isolated System

Feb 19

Structured Retrieval Orchestration: Why Multi-Agent Systems Need More Than RAG

Feb 19

I Read The Chief AI Officer’s Handbook So You Don’t Have To — Here’s What Actually Matters

AI leadership is evolving, and The Chief AI Officer’s Handbook attempts to capture what it takes to lead AI initiatives successfully. But…

Feb 19

I Read The Chief AI Officer’s Handbook So You Don’t Have To — Here’s What Actually Matters

Feb 19

Why Trusting LLM Outputs in Production Can Be Misleading and How We Can Quantify It

Why Should We Care About LLM Uncertainty?

Feb 18

Why Trusting LLM Outputs in Production Can Be Misleading and How We Can Quantify It

Feb 18

MARCO: Multi-Agent Real-time Chat Orchestration

Real-world deployment of multi-agent LLM based automation still faces major hurdles — inconsistencies, hallucinations, and inefficient…

Feb 17

MARCO: Multi-Agent Real-time Chat Orchestration

Feb 17

The Internet of Agents (IoA): Protocol for Autonomous AI Collaboration

Why Multi-Agent Systems Need a Rethink

Feb 16

The Internet of Agents (IoA): Protocol for Autonomous AI Collaboration

Feb 16

Anna Alexandra Grigoryan

Anna Alexandra Grigoryan

red schrödinger’s cat thinking of doing something brilliant

Following

Help
Status
About
Careers
Press
Blog
Privacy
Rules
Terms
Text to speech