API Reference¶
This section documents the public API of paperscraper.
Below you’ll find links to the documentation for each module:
paperscraper— Main package entry point.paperscraper.arxiv— ArXiv scraping & keyword searchpaperscraper.citations— Get (self-)citations & (self-)reference of papers and authorspaperscraper.get_dumps— Utilities to download bioRxiv, medRxiv & chemRxiv metadatapaperscraper.pdf— Download publications as pdfspaperscraper.pubmed— Pubmed keyword searchpaperscraper.scholar— Google Scholar endpointspaperscraper.xrxiv— Shared utilities for {bio,med,chem}Rxiv
Citation¶
If you use paperscraper, please cite a paper that motivated our development of this tool.
Top-level API¶
paperscraper
¶
Initialize the module.
dump_queries(keywords: List[List[Union[str, List[str]]]], dump_root: str) -> None
¶
Performs keyword search on all available servers and dump the results.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
keywords
|
List[List[Union[str, List[str]]]]
|
List of lists of keywords Each second-level list is considered a separate query. Within each query, each item (whether str or List[str]) are considered AND separated. If an item is again a list, strs are considered synonyms (OR separated). |
required |
dump_root
|
str
|
Path to root for dumping. |
required |