Revert "docs: Tranco list"

This reverts commit 47eb3ba29cd0fdb4ead7025c08a1fafa9bc4d31f.
667a55b51c
MDLeom 2020-04-02 22:51:12 +01:00
parent b1b7016675
commit b0cd7ea402
1 changed files with 1 additions and 1 deletions

2
FAQ.md

@ -1,7 +1,7 @@
- How is the filter created?
1. Grab the URLhaus **Database dump (CSV)** and save it to [URLhaus.csv](https://gitlab.com/curben/urlhaus-filter/blob/master/src/URLhaus.csv).
2. Extract the domains.
3. Exclude popular domains ([Tranco List](https://tranco-list.eu/)) and some other well-known domains not included in Tranco ([exclude.txt](https://gitlab.com/curben/urlhaus-filter/blob/master/src/exclude.txt)).
3. Exclude popular domains ([Umbrella Popularity List](https://s3-us-west-1.amazonaws.com/umbrella-static/index.html)) and some well-known domains (if not listed by Umbrella, see [exclude.txt](https://gitlab.com/curben/urlhaus-filter/blob/master/src/exclude.txt)).
4. Extract the URLs (from step 1) that include popular domains (Umbrella and exclude.txt).
5. Merge the files from step 3 and 4.
6. Lite version only parses online urls from that database.