>>30738269>Something like filename = md5(contents)Huh, that's actually a pretty clever way to go about it that didn't cross my mind. Would still need to build an index but that would be pretty simple.
The one issue with this strategy is if the actual page code changes, but the text remains the same. If you inspect a Google doc for example, there's a myriad of invisible tags embedded in the text. Doing a checksum of the text would be great to keep multiple versions of a story (say if someone made an edit), but since I have to grab the entire HTML and not just the text, if any of those tags is generated dynamically or changes often, we'd get tons of duplicates. I'm using Google, but to be honest Pastebin is the most likely to do this.
I could do an md5 checksum of the story's title with no character replacement, that would be enough for these purposes.
>>30738103Uoooh the only snack I would ever accept from this sheep is the sweet, sweet taste of her breastmilk.