Personally I don’t believe that LLMs on their own will ever achieve anything approaching sufficiently reliable/predictable/non-hallucinatory behavior. What this means for software that must harness their new potential--but must also be depended upon--is what we’re all groping with at the moment.
> Curious how others are handling nondeterminism in their stack.
I have an acquaintance who has been writing around this topic and I have been finding the discussion interesting (ex. https://www.varungodbole.com/p/why-do-companies-struggle-shipping ).
Personally I don’t believe that LLMs on their own will ever achieve anything approaching sufficiently reliable/predictable/non-hallucinatory behavior. What this means for software that must harness their new potential--but must also be depended upon--is what we’re all groping with at the moment.