X-Git-Url: http://www.git.stargrave.org/?a=blobdiff_plain;f=doc%2Fwarcs.texi;h=7408e9137fd85c8141717d5290c9ed454364627b;hb=ab67fcb95acbfab9778b6661ce10b2b2bb1bb0aa;hp=681354ea6b41ed11ba95efe61554a461004c0087;hpb=96c7dddd5295f66969470a8c70b9566e3e91834d;p=tofuproxy.git diff --git a/doc/warcs.texi b/doc/warcs.texi index 681354e..7408e91 100644 --- a/doc/warcs.texi +++ b/doc/warcs.texi @@ -78,17 +78,17 @@ save in-memory index to the disk as @file{....idx.gob} files. During the next load, if those files exists, they are used as index immediately, without expensive WARC parsing. -@code{redo warc-extract.cmd} utility uses exactly the same code for -parsing WARCs. It can be used to check if WARCs can be successfully +@code{cmd/warc-extract/warc-extract} utility uses exactly the same code +for parsing WARCs. It can be used to check if WARCs can be successfully loaded, to list all URIs after, to extract some specified URI and to pre-generate @file{.idx.gob} indices. @example -$ warc-extract.cmd -idx \ +$ cmd/warc-extract/warc-extract -idx \ smth.warc-00000.warc.gz \ smth.warc-00001.warc.gz \ smth.warc-00002.warc.gz -$ warc-extract.cmd -uri http://some/uri \ +$ cmd/warc-extract/warc-extract -uri http://some/uri \ smth.warc-00000.warc.gz \ smth.warc-00001.warc.gz \ smth.warc-00002.warc.gz @@ -99,8 +99,7 @@ from any kind of already existing WARCs. It has better compression ratio and much higher decompression speed, than @file{.warc.gz}. @example -$ redo cmd/zstd/enzstd -$ ./warc-extract.cmd -for-enzstd /path/to.warc.gz | +$ cmd/warc-extract/warc-extract -for-enzstd /path/to.warc.gz | cmd/zstd/enzstd > /path/to.warc.zst @end example