API Reference¶
This section documents the public API of paperscraper.
Below you’ll find links to the documentation for each module:
paperscraper
— Main package entry point.paperscraper.arxiv
— ArXiv scraping & keyword searchpaperscraper.citations
— Get (self-)citations & (self-)reference of papers and authorspaperscraper.get_dumps
— Utilities to download bioRxiv, medRxiv & chemRxiv metadatapaperscraper.pdf
— Download publications as pdfspaperscraper.pubmed
— Pubmed keyword searchpaperscraper.scholar
— Google Scholar endpointspaperscraper.xrxiv
— Shared utilities for {bio,med,chem}Rxiv
Citation¶
If you use paperscraper
, please cite a paper that motivated our development of this tool.
Top-level API¶
paperscraper
¶
Initialize the module.
dump_queries(keywords: List[List[Union[str, List[str]]]], dump_root: str) -> None
¶
Performs keyword search on all available servers and dump the results.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
keywords
|
List[List[Union[str, List[str]]]]
|
List of lists of keywords Each second-level list is considered a separate query. Within each query, each item (whether str or List[str]) are considered AND separated. If an item is again a list, strs are considered synonyms (OR separated). |
required |
dump_root
|
str
|
Path to root for dumping. |
required |