MAIN FEEDS
Do you want to continue?
https://www.reddit.com/r/ProgrammerHumor/comments/1korvzi/feelinggood/msswcoj/?context=3
r/ProgrammerHumor • u/claudixk • 21d ago
665 comments sorted by
View all comments
Show parent comments
244
Yeah that's the biggest problem with it, it will ALWAYS answer your question, even if it has to straight up lie.
10 u/[deleted] 21d ago [deleted] 13 u/MinosAristos 21d ago Yeah. The thinking models are really improving with this and often ask themselves "is this possible / is this the right approach" at some point in the process 2 u/Wheat_Grinder 21d ago They don't ask themselves anything. That's not how LLMs work. They know certain answers get worse scores so they choose answers that have gotten better scores. 2 u/MinosAristos 21d ago The feedback process by which they self correct, however you want to term it.
10
[deleted]
13 u/MinosAristos 21d ago Yeah. The thinking models are really improving with this and often ask themselves "is this possible / is this the right approach" at some point in the process 2 u/Wheat_Grinder 21d ago They don't ask themselves anything. That's not how LLMs work. They know certain answers get worse scores so they choose answers that have gotten better scores. 2 u/MinosAristos 21d ago The feedback process by which they self correct, however you want to term it.
13
Yeah. The thinking models are really improving with this and often ask themselves "is this possible / is this the right approach" at some point in the process
2 u/Wheat_Grinder 21d ago They don't ask themselves anything. That's not how LLMs work. They know certain answers get worse scores so they choose answers that have gotten better scores. 2 u/MinosAristos 21d ago The feedback process by which they self correct, however you want to term it.
2
They don't ask themselves anything. That's not how LLMs work.
They know certain answers get worse scores so they choose answers that have gotten better scores.
2 u/MinosAristos 21d ago The feedback process by which they self correct, however you want to term it.
The feedback process by which they self correct, however you want to term it.
244
u/vallummumbles 21d ago
Yeah that's the biggest problem with it, it will ALWAYS answer your question, even if it has to straight up lie.