Load WARCs:
@example
-$ tee fifos/add-warcs < warcs.txt
+$ tee fifos/add-warcs <warcs.txt
smth.warc-00000.warc.gz
smth.warc-00001.warc.gz
smth.warc-00002.warc.gz
smth.warc-00001.warc.gz 13
smth.warc-00002.warc.gz 0
another.warc 123
-$ echo another.warc > fifos/del-warcs
+$ echo another.warc >fifos/del-warcs
@end example
One possibility that @file{smth.warc-00002.warc.gz} has no URIs is that
@end itemize
Loading of WARC involves its whole reading and remembering where is each
-URI response is located. You can @code{echo SAVE > fifos/add-warcs} to
+URI response is located. You can @code{echo SAVE >fifos/add-warcs} to
save in-memory index to the disk as @file{....idx.gob} files. During
the next load, if those files exists, they are used as index immediately,
without expensive WARC parsing.
@example
$ cmd/warc-extract/warc-extract -for-enzstd /path/to.warc.gz |
- cmd/zstd/enzstd > /path/to.warc.zst
+ cmd/zstd/enzstd >/path/to.warc.zst
@end example
@url{https://www.gnu.org/software/wget/, GNU Wget} can be easily used to