]>
Sergey Matveev's repositories - feeder.git/log
summary |
shortlog | log |
commit |
commitdiff |
tree
first ⋅ prev ⋅ next
Sergey Matveev [Sun, 20 Feb 2022 16:06:47 +0000 (19:06 +0300)]
Compress WARCs
Sergey Matveev [Sun, 20 Feb 2022 15:51:49 +0000 (18:51 +0300)]
Sectioned usage
Sergey Matveev [Sun, 20 Feb 2022 15:21:01 +0000 (18:21 +0300)]
Rebranding
Sergey Matveev [Sun, 20 Feb 2022 14:43:33 +0000 (17:43 +0300)]
Mention encs/ and warcs/
Sergey Matveev [Sun, 20 Feb 2022 14:42:18 +0000 (17:42 +0300)]
Print newly created feed
Sergey Matveev [Sun, 20 Feb 2022 14:35:12 +0000 (17:35 +0300)]
Shuffle jobs
Sergey Matveev [Sun, 20 Feb 2022 11:37:32 +0000 (14:37 +0300)]
Various refactoring and parametrizing
* cmd/env.rc has all options you can override
* HTTP/HTTP proxy for curl/wget is controlled with
http_proxy/https_proxy environment variable
* User-Agent is not hard-coded, can be even empty
* download-n-parse helper
* Example mailcap
Sergey Matveev [Sat, 19 Feb 2022 10:44:45 +0000 (13:44 +0300)]
Corrent mtime won't hurt
Sergey Matveev [Sat, 19 Feb 2022 10:22:15 +0000 (13:22 +0300)]
More robust realpath
Sergey Matveev [Sat, 19 Feb 2022 09:15:14 +0000 (12:15 +0300)]
Sortable date in filenames
Sergey Matveev [Sat, 19 Feb 2022 09:05:40 +0000 (12:05 +0300)]
encs/warcs destination is optional
Sergey Matveev [Fri, 18 Feb 2022 17:01:45 +0000 (20:01 +0300)]
Per-feed max number of entries
Sergey Matveev [Fri, 18 Feb 2022 16:35:16 +0000 (19:35 +0300)]
Latest post time should be the directory mtime
Because some feeds has older feed's datetime than its posts ones.
Sergey Matveev [Fri, 18 Feb 2022 16:34:59 +0000 (19:34 +0300)]
Unify @command usage
Sergey Matveev [Fri, 18 Feb 2022 16:34:49 +0000 (19:34 +0300)]
x-urlopen.sh
Sergey Matveev [Fri, 18 Feb 2022 13:54:27 +0000 (16:54 +0300)]
Excess assignments
Sergey Matveev [Fri, 18 Feb 2022 13:52:46 +0000 (16:52 +0300)]
Do not overwrite mutt.rc every time
Sergey Matveev [Fri, 18 Feb 2022 13:44:22 +0000 (16:44 +0300)]
There are empty authors in lame feeds
Sergey Matveev [Fri, 18 Feb 2022 12:00:33 +0000 (15:00 +0300)]
Enclosures and WARCs downloader
Sergey Matveev [Fri, 18 Feb 2022 10:18:46 +0000 (13:18 +0300)]
Single X-Categories header
Sergey Matveev [Fri, 18 Feb 2022 10:07:15 +0000 (13:07 +0300)]
Show disappeared feeds
Sergey Matveev [Fri, 18 Feb 2022 09:53:44 +0000 (12:53 +0300)]
Try to use GUID
Some fucking feeds update their content all the time feed is downloaded.
But at least they try to preserve their GUIDs among all entries.
Sergey Matveev [Fri, 18 Feb 2022 08:20:51 +0000 (11:20 +0300)]
Various refactoring
* No excess redo
* parallel is used, with failed jobs printing
* All commands can be run outside the project
* feed.zst -- it multiple times smaller
* Various small fixes
Sergey Matveev [Fri, 18 Feb 2022 07:39:28 +0000 (10:39 +0300)]
Fix title
Sergey Matveev [Fri, 18 Feb 2022 07:04:13 +0000 (10:04 +0300)]
More often stats checking
Sergey Matveev [Fri, 18 Feb 2022 07:00:38 +0000 (10:00 +0300)]
No wide size
Sergey Matveev [Fri, 18 Feb 2022 07:00:19 +0000 (10:00 +0300)]
Use title+content as more reliable item identifier
Sergey Matveev [Fri, 18 Feb 2022 06:59:51 +0000 (09:59 +0300)]
CDATA harms rendering
Sergey Matveev [Fri, 18 Feb 2022 06:59:22 +0000 (09:59 +0300)]
Be more accurate with UTC offset
Sergey Matveev [Fri, 18 Feb 2022 06:59:07 +0000 (09:59 +0300)]
Prefer published data over updated
Sergey Matveev [Thu, 17 Feb 2022 16:55:37 +0000 (19:55 +0300)]
Initial commit