weblib.files

Miscellaneous utilities which are helpful sometime.

weblib.files.clear_directory(path)[source]

Delete recursively all directories and files in specified directory.

weblib.files.unique_file(path)[source]

Drop non-unique lines in the file. Return number of unique lines.

weblib.files.unique_host(path)[source]

Filter out urls with duplicated hostnames.