Commit Graph

8 Commits

Author SHA1 Message Date
curben d4bef6923f fix: merge duplicate operations
remove simple comments
2019-05-28 12:32:08 +09:30
curben f9e1cb84ce fix: run dos2unix before text processing
rename urlhaus.txt in tmp/
2019-05-28 09:59:02 +09:30
curben 9a5fdb2be6 fix: use simple URL list
we no longer care the status of URL
bb817d9838
2019-05-27 15:59:08 +09:30
curben bb817d9838 fix: use all URLs including offline's
upstream (urlhaus.abuse.ch) incorrectly marks many online urls as offline.
noticed from 6c7faa95f7
2019-05-27 15:10:12 +09:30
curben a7046c77a6 refactor: move script executions from CI config to index.sh
easier to test locally
2019-05-27 15:01:57 +09:30
curben ea6c3f6796 refactor: remove '-e' parameter of sed
not necessary if there is only one script
https://unix.stackexchange.com/a/33159
2019-05-17 18:13:26 +09:30
curben 013267e310 perf: grep using urlhaus-top-domains.txt instead of much larger top-1m.txt 2019-05-12 12:48:13 +09:30
curben 5beecca906 feat: include full URL for popular domains 2019-05-11 18:49:25 +09:30