Set Options for the ccwarcs Package

ccwarcs_options(cache = "~/.ccwarcs_cache", cdx_sleep = 0.3,
  page_size = NULL)

Arguments

cache

Location for the cached CDX index results and WARC files

cdx_sleep

Amount of time (seconds) to wait after a CDX server call before resuming

page_size

An optional parameter to set the amount of data returned in each page

Value

A list object of class ccwarcs_options

Examples

if (FALSE) { opts <- ccwarcs_options(cache = "my_project_directory") url <- "http://www.celebuzz.com/2017-01-04" crawl <- "2018-47" results <- get_cc_index(url, crawl, .options = opts) }