

Yeah, people are frequently terrible at understanding context so it shouldn’t be surprising that a computer has difficulty too.
There are actually a lot of specialized applications of neural network based computing being used for science, but they don’t get the flashy headlines because they are a tool. Those projects use it to find things to focus on narrowing down what people should look into first for confirmation, like ancient settlement patterns, stars that might have planets, and other things where patterns exist but are hard to see.
Some examples are listed here at a high level. In all cases the ai leads to humans confirming and then working from there, it isn’t the end result on its own. https://medium.com/@jeyadev_needhi/uncovering-the-past-how-ai-is-transforming-archaeology-38ded420896d
Ewww, no. The programmer should have run their unit tests, maybe even told you about them. You should be testing for edge cases not covered by the unit tests at a minimum and replicating the unit tests if they don’t appear to be very thorough.