feat: add AdGuard Home-specific blocklist

- AdGuard Home is still compatible with URL-based and hosts-based
- https://gitlab.com/curben/urlhaus-filter/-/issues/22
This commit is contained in:
MDLeom 2020-09-01 06:36:05 +00:00
parent bf007654a3
commit f8a22696ab
No known key found for this signature in database
GPG Key ID: 32D3E28E96A695E8
3 changed files with 6836 additions and 1 deletions

View File

@ -6,6 +6,7 @@ There are multiple formats available, refer to the appropriate section according
- uBlock Origin (uBO) -> [URL-based](#url-based) section (recommended)
- Pi-hole -> [Domain-based](#domain-based) or [Hosts-based](#hosts-based) section
- AdGuard Home -> [Domain-based (AdGuard Home)](#domain-based-adguard-home) or [Hosts-based](#hosts-based) section
- Hosts file -> [Hosts-based](#hosts-based) section
- Dnsmasq -> [Dnsmasq](#dnsmasq) section
- BIND -> [BIND](#bind) section
@ -29,6 +30,8 @@ Import the following URL into uBO to subscribe:
- https://gitcdn.xyz/repo/curbengh/phishing-filter/master/dist/phishing-filter.txt
- https://cdn.jsdelivr.net/gh/curbengh/phishing-filter/dist/phishing-filter.txt
**Note:** AdGuard Home is not compatible with the URL filter (`||baddomain.com/bad/page$all`) of this blocklist, although it is still compatible with the domain filter (`||baddomain.com^`) also used in this blocklist, consider using a [smaller blocklist](#domain-based-adguard-home) (it's around half smaller).
</details>
## Domain-based
@ -49,6 +52,24 @@ This blocklist includes domains and IP addresses.
</details>
## Domain-based (AdGuard Home)
This AdGuard Home-compatible blocklist includes domains and IP addresses.
- https://gitlab.com/curben/phishing-filter/raw/master/dist/phishing-filter-agh.txt
<details>
<summary>Mirrors</summary>
- https://cdn.statically.io/gl/curben/phishing-filter/master/dist/phishing-filter-agh.txt
- https://glcdn.githack.com/curben/phishing-filter/raw/master/dist/phishing-filter-agh.txt
- https://raw.githubusercontent.com/curbengh/phishing-filter/master/dist/phishing-filter-agh.txt
- https://cdn.statically.io/gh/curbengh/phishing-filter/master/dist/phishing-filter-agh.txt
- https://gitcdn.xyz/repo/curbengh/phishing-filter/master/dist/phishing-filter-agh.txt
- https://cdn.jsdelivr.net/gh/curbengh/phishing-filter/phishing-filter-agh.txt
</details>
## Hosts-based
This blocklist includes domains only.

6806
dist/phishing-filter-agh.txt vendored Normal file

File diff suppressed because it is too large Load Diff

View File

@ -18,6 +18,8 @@ fi
mkdir -p "tmp/"
cd "tmp/"
PHISHTANK_API="f6d24512707c96f8d01875ace77d52950a5b9157e1535ea70e7f704efb3e4066"
## Prepare datasets
curl -L "https://data.phishtank.com/data/$PHISHTANK_API/online-valid.csv.bz2" -o "phishtank.bz2"
curl -L "https://openphish.com/feed.txt" -o "openphish-raw.txt"
@ -111,11 +113,17 @@ SIXTH_LINE="! Source: https://www.phishtank.com/ & https://openphish.com/"
COMMENT_UBO="$FIRST_LINE\n$SECOND_LINE\n$THIRD_LINE\n$FOURTH_LINE\n$FIFTH_LINE\n$SIXTH_LINE"
# Compatibility with Adguard Home
# https://gitlab.com/curben/urlhaus-filter/-/issues/19
# curben/urlhaus-filter#19
cat "phishing-notop-domains.txt" | \
sed "s/^/||/g" | \
sed "s/$/^/g" > "phishing-domains-adguard.txt"
# curben/urlhaus-filter#22
cat "phishing-domains-adguard.txt" | \
sort | \
sed '1 i\'"$COMMENT_UBO"'' | \
sed "1s/Blocklist/Blocklist (AdGuard Home)/" > "../dist/phishing-filter-agh.txt"
cat "phishing-domains-adguard.txt" "phishing-url-top-domains.txt" | \
sort | \
sed '1 i\'"$COMMENT_UBO"'' > "../dist/phishing-filter.txt"