Set Options for the ccwarcs Package
ccwarcs_options(cache = "~/.ccwarcs_cache", cdx_sleep = 0.3, page_size = NULL)
cache | Location for the cached CDX index results and WARC files |
---|---|
cdx_sleep | Amount of time (seconds) to wait after a CDX server call before resuming |
page_size | An optional parameter to set the amount of data returned in each page |
A list object of class ccwarcs_options
if (FALSE) { opts <- ccwarcs_options(cache = "my_project_directory") url <- "http://www.celebuzz.com/2017-01-04" crawl <- "2018-47" results <- get_cc_index(url, crawl, .options = opts) }