The autowebcompat's crawler is at https://github.com/marco-c/autowebcompat/blob/master/collect.py.
For this issue, we should:
- investigate whether it's feasible to write a single crawler that can be used for both projects (or does one project require something that really doesn't work for the other?);
- create a crawler library and make both this project and autowebcompat use it.
If we find that we can't share the entire crawler, we can probably at least share parts of it (we could create a crawler_utils library).