That’s the reason for the maze. These companies have multiple IP addresses and bots that communicate with each other.
They can go through multiple entries in the robot.txt file. Once they learn they are banned, they go scrape the old fashioned way with another IP address.
But if you create a maze, they just continually scrape useless data, rather than scraping data you don’t want them to get.
Banning IP ranges isn’t going to work. A lot of these companies rent out home IP addresses.
Also the point isn’t just protecting content, it’s data poisoning.