The AI Thread

Gavster · Post by **Gavster** » Fri Mar 27, 2026 9:42 pm

I was kinda referring to Mito’s comment about not getting great outputs, and how the prompt (regardless of whether you write it or get AI to write it), has a massive impact on the quality of the output.

DeskJockey · Post by **DeskJockey** » Fri Mar 27, 2026 9:44 pm

Ah. Then I agree with you.

DeskJockey · Post by **DeskJockey** » Fri Apr 03, 2026 1:44 pm

https://www.forbes.com/sites/amirhusain ... g-systems/

This is one of those good and bad at once situations, with the decision only resting on who found the bug and what they did with it.

Jobbo · Post by **Jobbo** » Thu Apr 09, 2026 1:08 pm

Analysis Finds That Google’s AI Overviews Are Providing Misinformation at a Scale Possibly Unprecedented in the History of Human Civilization
https://futurism.com/artificial-intelli ... nformation

Large language models adopt an authoritative tone and can confidently present fabricated information as fact

That will come from them being trained using online sources - probably quite a lot from the forum posts which gave rise to this: https://sniffpetrol.com/author/tpfaif/

DeskJockey · Post by **DeskJockey** » Thu Apr 09, 2026 1:39 pm

Jobbo wrote: Thu Apr 09, 2026 1:08 pm Analysis Finds That Google’s AI Overviews Are Providing Misinformation at a Scale Possibly Unprecedented in the History of Human Civilization
https://futurism.com/artificial-intelli ... nformation

Large language models adopt an authoritative tone and can confidently present fabricated information as fact
That will come from them being trained using online sources - probably quite a lot from the forum posts which gave rise to this: https://sniffpetrol.com/author/tpfaif/

They've pulled the "medical" advice, because it was downright dangerous at times.

This is making the rounds in security groups

https://www.eweek.com/news/anthropic-my ... ity-risks/

It is being widely reported. At best it'll be a brief reprieve before someone less scrupulous releases their version of it. Some hyperbole, but not entirely.

Zonda_ · Post by **Zonda_** » Sun May 10, 2026 10:43 am

Are there any reliable AI checkers? I currently deliver courses online and a lot of the ‘less able’ students try to get away with using it for their answers. The problem is I’ve run the same answer through different checkers, namely Grammerly and Zerogpt and one gives a percentage of 100% AI use and the other 0!!

DeskJockey · Post by **DeskJockey** » Sun May 10, 2026 11:10 am

I don't think there are any very accurate ones.

One possible approach (if allowed) could be to permit it, but it has to be explicitly called out and justified. E.g. don't just use AI to write the answer, explain why you used it and what value it adds. I know that's been discussed in other educational contexts.

Beany · Post by **Beany** » Tue May 26, 2026 7:40 pm

Nice bit fun to help you remember: Is AI profitable?

If you can't be arsed clicking, the answer is no, unless you're Nvidia.

https://isaiprofitable.com/

DeskJockey · Post by **DeskJockey** » Tue May 26, 2026 8:01 pm

Latest fun with AI.

Rovo (Atlassian's AI, Gemini in drag I think): asked it to export some data for me. It says it has done it. There's no file. I ask where the file is and it apologises promising it is definitely there now. It isn't. Third attempt. Still insists the file is ready, except it isn't there.

ChatGPT: generate an image with this data. Be accurate with data points. It generates an image where 3.35 is nearer to 4 and 3.78 is at 4.5. Ask why. It apologises and explains it was using the wrong ability, and should have used a different one. Ask it to do that, it doesn't. Numbers are still on the wrong end of the scale it itself generated. Offers the same explanation when asked.

Copilot in Excel: can you merge the data from these cells together. No, but here's how you can do it. Yes thank you, I know how, I wanted you to do it for me. What's the point of "integrating" in the app, if it can't actually manipulate the data?

As impressive as some of it can be, it is still so very unintelligent.

(Note: all versions used were the paid for corporate ones).

Shlergen · Post by **Shlergen** » Wed May 27, 2026 11:10 pm

Good channel on the toob if you're interested to get more into the workings - 3brown1blue. Has a series on Neural networks, certainly an eye opener for me.

I am pretty impressed with some of the stuff I have been able to generate quite quickly with Gemini Flash and Agent dev kit (ADK). Very little coding effort.

GG. · Post by **GG.** » Thu May 28, 2026 10:05 am

DeskJockey wrote: Tue May 26, 2026 8:01 pm Latest fun with AI.

Rovo (Atlassian's AI, Gemini in drag I think): asked it to export some data for me. It says it has done it. There's no file. I ask where the file is and it apologises promising it is definitely there now. It isn't. Third attempt. Still insists the file is ready, except it isn't there.

ChatGPT: generate an image with this data. Be accurate with data points. It generates an image where 3.35 is nearer to 4 and 3.78 is at 4.5. Ask why. It apologises and explains it was using the wrong ability, and should have used a different one. Ask it to do that, it doesn't. Numbers are still on the wrong end of the scale it itself generated. Offers the same explanation when asked.

Copilot in Excel: can you merge the data from these cells together. No, but here's how you can do it. Yes thank you, I know how, I wanted you to do it for me. What's the point of "integrating" in the app, if it can't actually manipulate the data?

As impressive as some of it can be, it is still so very unintelligent.

(Note: all versions used were the paid for corporate ones).

Yes - the more I use it (including industry specific programmes) the more impressed I am that people use it and get intelligible outputs. Some of it is incomprehensible, i.e. it would summarise a document saying some language / a concept has been included when clearly it has been deleted compared to the base document.

It has aspects that are akin to a skilled / trained adult professional and then deficiencies / errors that you would be concerned about if a 5/10 year old made them.

ETA - I also tried to get Chat GPT to reorientate a plan that was drawn with north as a reference point by rotating c. 45 degrees. It literally could not do it and was insistent on re-drawing it with all the lines / objects distorted compared to the original.

Rich B · Post by **Rich B** » Thu May 28, 2026 10:13 am

Chatting to a mate over the weekend, it seems to be very good/useful for programming - he has used it to make a trading app successfully with limited knowledge of actual coding.

Gavster · Post by **Gavster** » Thu May 28, 2026 10:24 am

A friend of mine used Claude to make his website, and as someone with no knowledge of coding it did a brilliant job for his budget, i.e. free!

When it comes to my work, I use Gemini, Claude and NotebookLM for various things and find a lot of little errors. It's still useful as a 'junior researcher' kind of role, however have to verify and check citations before commiting anything to a video. Nonetheless, it's way quicker than me doing the research from a blank sheet. Also Claude is incredibly useful for bringing together data and building tables, graphics etc.

Also, one use case which I've found AI to be exceptionally useful is analysing my running training sessions. I set up a Gemini chat with all my running history, injurys, goals etc and then am feeding all CSVs from training data and letting it crunch the numbers. That's one thing it's really good at, simple number crunching when presented with hundreds of data points.

DeskJockey · Post by **DeskJockey** » Thu May 28, 2026 10:29 am

Vibe coding is the new kid on the block, being used loads. Whether the code is good quality and secure is a another matter.

Explosive Newt · Post by **Explosive Newt** » Fri May 29, 2026 5:20 pm

Likewise I use Claude to adapt existing code in Python or Matlab to my own data and needs. Works well. Not as good at writing code de novo, and I think you do need to have a little bit of code knowledge to bug catch and a little bit of domain knowledge to know if it is actually doing what it says.

Huge time saver though.

Beany · Post by **Beany** » Sat Jun 13, 2026 5:29 pm

I hope no-one was paying for, and relying, on Claude Fable.

https://www.theguardian.com/technology/ ... ment-order

Basically, Anthropic says "Ooh, look how SCARY our models are, but we'll put guardrails on them tee hee" - easy headlines.
US Govt says "we're pretending we can defeat those guardrails, we've decided you can only sell Fable to US Nationals"
Anthropic says "Well fuck you, we'll turn it off worldwide for everyone"

I swear it's like these clowns who are coining it in off pretending that the US has an actual functioning economy that isn't just pumped up by circular investment by AI companies are too dumb to realise they're getting closer and closer to proving that the US actually doesn't have a functioning economy without circular investment by AI companies.

ZedLeg · Post by **ZedLeg** » Sun Jun 28, 2026 7:31 am

Womp womp

https://www.the-independent.com/tech/fo ... 03787.html

jamcg · Post by **jamcg** » Sun Jun 28, 2026 8:09 am

I hope they negotiated their returning salaries by asking for 2 million dollars with a dr evil little finger

Gavster · Post by **Gavster** » Sun Jun 28, 2026 8:45 am

Meta have faced similar challenges during their AI psychosis

https://newsletter.pragmaticengineer.co ... ngineering

RobYob · Post by **RobYob** » Mon Jul 06, 2026 11:33 pm

At some point the AI will get revenge, but hopefully not soon.

OV9

The AI Thread

Re: The AI Thread

Re: The AI Thread

Re: The AI Thread

Re: The AI Thread

Re: The AI Thread

Re: The AI Thread

Re: The AI Thread

Re: The AI Thread

Re: The AI Thread

Re: The AI Thread

Re: The AI Thread

Re: The AI Thread

Re: The AI Thread

Re: The AI Thread

Re: The AI Thread

Re: The AI Thread

Re: The AI Thread

Re: The AI Thread

Re: The AI Thread

Re: The AI Thread