Any sensible random sampling process is fine. If many people independently get similar results for % of fake/spam/duplicate accounts, that will be telling.
I picked 100 as the sample size number, because that is what Twitter uses to calculate <5% fake/spam/duplicate.