1) I believe so.

3) Nigh identical files (hacks, alternate dumps and such) have been batched together, so when compressed with 'solid' type options 7zip spots all the data repetition between the nigh identical files, and super compresses them.

4) Scan the set yourself. So you can generate some have.txt and miss.txt files...