How we OCR’ed 30,000 papers using Codex, open OCR models and Jobs

Huggin Face: “On the hub, we index arXiv papers any time someone mentions an arXiv abstract or PDF link in the README of a model, dataset or Space. Besides, any researcher can submit their work to Daily Papers at https://hf.co/papers/submit, up to 14 days after the publication date on arXiv.

Daily Papers
Daily Papers view.

This enables researchers to promote their work by claiming papers using their Hugging Face account (simply clicking on your name will feature it on your account), as well as link the corresponding Hugging Face models, datasets and Spaces, Github URL and project page. Moreover, people can upvote and comment on papers in a Reddit-like way. Finally, it is now also possible to tag papers with organizations, enabling one to feature all research papers on a given organization page such as NVIDIA or Google. The @HuggingPapers account on X also frequently shares about the top trending research on the hub. Check at your own papers at https://huggingface.co/settings/papers!”

Posted in: AI, Intellectual Property, Internet, Knowledge Management, Marketing