From b0cd7ea402d7e5e1bb14486d87fb5a197feb114f Mon Sep 17 00:00:00 2001 From: MDLeom <2809763-curben@users.noreply.gitlab.com> Date: Thu, 2 Apr 2020 22:51:12 +0100 Subject: [PATCH] Revert "docs: Tranco list" This reverts commit 47eb3ba29cd0fdb4ead7025c08a1fafa9bc4d31f. https://gitlab.com/curben/urlhaus-filter/-/commit/667a55b51cce96863c327ef0978c5af5ff8fbb4a --- FAQ.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/FAQ.md b/FAQ.md index a44a87e..151bf85 100644 --- a/FAQ.md +++ b/FAQ.md @@ -1,7 +1,7 @@ - How is the filter created? 1. Grab the URLhaus **Database dump (CSV)** and save it to [URLhaus.csv](https://gitlab.com/curben/urlhaus-filter/blob/master/src/URLhaus.csv). 2. Extract the domains. - 3. Exclude popular domains ([Tranco List](https://tranco-list.eu/)) and some other well-known domains not included in Tranco ([exclude.txt](https://gitlab.com/curben/urlhaus-filter/blob/master/src/exclude.txt)). + 3. Exclude popular domains ([Umbrella Popularity List](https://s3-us-west-1.amazonaws.com/umbrella-static/index.html)) and some well-known domains (if not listed by Umbrella, see [exclude.txt](https://gitlab.com/curben/urlhaus-filter/blob/master/src/exclude.txt)). 4. Extract the URLs (from step 1) that include popular domains (Umbrella and exclude.txt). 5. Merge the files from step 3 and 4. 6. Lite version only parses online urls from that database.