Discussion about this post

User's avatar
Suhel's avatar

I also found Opus 4.8 hallucinating terribly.

I asked both Opus 4.8 and GPT 5.5 to search for specific listings and give me their review of findings. And Opus made conclusions and recommendations based on hallucinations and outdated information.

And when i followed up with this issue it tried to gaslight me by saying that GPT was incorrect and the original info it hallucinated really did exist. I pushed back a third time and it capitulated but claimed that the listing information is dynamic and these listings are published then taken down often so Opus was actually right overall but in this specific instance it looks I was correct as right now it's not live. 🫠

It's like working with a stubborn arrogant coworker who always has to be right and get the last word in. 😆

Its been a very long time since I've experienced anything this bad too.

Mark S. Carroll's avatar

What I appreciate here is that this does not just report on AI tools. It quietly shows the shift from tool use to workflow design.

The most interesting pattern across all three segments is not “AI can do impressive things.” It is that the real leverage starts showing up when people stop treating AI like a chatbot and start treating it more like an operator, a collaborator, or a system with boundaries, goals, and handoff points.

That is the bigger story to me. Not just better outputs. Better orchestration.

No posts

Ready for more?