Arxiv API for preprint publications

import pandas as pd
import requests
import json
import xmltodict

arXiv preprint API

https://info.arxiv.org/help/api/basics.html#about

For instance, one can ask for preprints about LLM: https://export.arxiv.org/api/query?search_query=all:LLM

xml_url = 'https://export.arxiv.org/api/query?search_query=all:LLM'
req = requests.get(xml_url)
req.content
b'<?xml version="1.0" encoding="UTF-8"?>\n<feed xmlns="http://www.w3.org/2005/Atom">\n  <link href="http://arxiv.org/api/query?search_query%3Dall%3ALLM%26id_list%3D%26start%3D0%26max_results%3D10" rel="self" type="application/atom+xml"/>\n  <title type="html">ArXiv Query: search_query=all:LLM&amp;id_list=&amp;start=0&amp;max_results=10</title>\n  <id>http://arxiv.org/api/9lR339xkGcMQx0S13ig0mEetXXI</id>\n  <updated>2024-04-08T00:00:00-04:00</updated>\n  <opensearch:totalResults xmlns:opensearch="http://a9.com/-/spec/opensearch/1.1/">9527</opensearch:totalResults>\n  <opensearch:startIndex xmlns:opensearch="http://a9.com/-/spec/opensearch/1.1/">0</opensearch:startIndex>\n  <opensearch:itemsPerPage xmlns:opensearch="http://a9.com/-/spec/opensearch/1.1/">10</opensearch:itemsPerPage>\n  <entry>\n    <id>http://arxiv.org/abs/2311.10372v2</id>\n    <updated>2024-01-08T05:41:51Z</updated>\n    <published>2023-11-17T07:55:16Z</published>\n    <title>A Survey of Large Language Models for Code: Evolution, Benchmarking, and\n  Future Trends</title>\n    <summary>  General large language models (LLMs), represented by ChatGPT, have\ndemonstrated significant potential in tasks such as code generation in software\nengineering. This has led to the development of specialized LLMs for software\nengineering, known as Code LLMs. A considerable portion of Code LLMs is derived\nfrom general LLMs through model fine-tuning. As a result, Code LLMs are often\nupdated frequently and their performance can be influenced by the base LLMs.\nHowever, there is currently a lack of systematic investigation into Code LLMs\nand their performance. In this study, we conduct a comprehensive survey and\nanalysis of the types of Code LLMs and their differences in performance\ncompared to general LLMs. We aim to address three questions: (1) What LLMs are\nspecifically designed for software engineering tasks, and what is the\nrelationship between these Code LLMs? (2) Do Code LLMs really outperform\ngeneral LLMs in software engineering tasks? (3) Which LLMs are more proficient\nin different software engineering tasks? To answer these questions, we first\ncollect relevant literature and work from five major databases and open-source\ncommunities, resulting in 134 works for analysis. Next, we categorize the Code\nLLMs based on their publishers and examine their relationships with general\nLLMs and among themselves. Furthermore, we investigate the performance\ndifferences between general LLMs and Code LLMs in various software engineering\ntasks to demonstrate the impact of base models and Code LLMs. Finally, we\ncomprehensively maintained the performance of LLMs across multiple mainstream\nbenchmarks to identify the best-performing LLMs for each software engineering\ntask. Our research not only assists developers of Code LLMs in choosing base\nmodels for the development of more advanced LLMs but also provides insights for\npractitioners to better understand key improvement directions for Code LLMs.\n</summary>\n    <author>\n      <name>Zibin Zheng</name>\n    </author>\n    <author>\n      <name>Kaiwen Ning</name>\n    </author>\n    <author>\n      <name>Yanlin Wang</name>\n    </author>\n    <author>\n      <name>Jingwen Zhang</name>\n    </author>\n    <author>\n      <name>Dewu Zheng</name>\n    </author>\n    <author>\n      <name>Mingxi Ye</name>\n    </author>\n    <author>\n      <name>Jiachi Chen</name>\n    </author>\n    <link href="http://arxiv.org/abs/2311.10372v2" rel="alternate" type="text/html"/>\n    <link title="pdf" href="http://arxiv.org/pdf/2311.10372v2" rel="related" type="application/pdf"/>\n    <arxiv:primary_category xmlns:arxiv="http://arxiv.org/schemas/atom" term="cs.SE" scheme="http://arxiv.org/schemas/atom"/>\n    <category term="cs.SE" scheme="http://arxiv.org/schemas/atom"/>\n  </entry>\n  <entry>\n    <id>http://arxiv.org/abs/2308.08241v2</id>\n    <updated>2024-02-22T02:03:42Z</updated>\n    <published>2023-08-16T09:16:02Z</published>\n    <title>TEST: Text Prototype Aligned Embedding to Activate LLM\'s Ability for\n  Time Series</title>\n    <summary>  This work summarizes two ways to accomplish Time-Series (TS) tasks in today\'s\nLarge Language Model (LLM) context: LLM-for-TS (model-centric) designs and\ntrains a fundamental large model, or fine-tunes a pre-trained LLM for TS data;\nTS-for-LLM (data-centric) converts TS into a model-friendly representation to\nenable the pre-trained LLM to handle TS data. Given the lack of data, limited\nresources, semantic context requirements, and so on, this work focuses on\nTS-for-LLM, where we aim to activate LLM\'s ability for TS data by designing a\nTS embedding method suitable for LLM. The proposed method is named TEST. It\nfirst tokenizes TS, builds an encoder to embed TS via instance-wise,\nfeature-wise, and text-prototype-aligned contrast, where the TS embedding space\nis aligned to LLM embedding layer space, then creates soft prompts to make LLM\nmore open to that embeddings, and finally implements TS tasks using the frozen\nLLM. We also demonstrate the feasibility of TS-for-LLM through theory and\nexperiments. Experiments are carried out on TS classification, forecasting, and\nrepresentation tasks using eight frozen LLMs with various structures and sizes.\nThe results show that the pre-trained LLM with TEST strategy can achieve better\nor comparable performance than today\'s SOTA TS models and offer benefits for\nfew-shot and generalization. By treating LLM as the pattern machine, TEST can\nendow LLM\'s ability to process TS data without compromising language ability.\nWe hope that this study will serve as a foundation for future work to support\nTS+LLM progress.\n</summary>\n    <author>\n      <name>Chenxi Sun</name>\n    </author>\n    <author>\n      <name>Hongyan Li</name>\n    </author>\n    <author>\n      <name>Yaliang Li</name>\n    </author>\n    <author>\n      <name>Shenda Hong</name>\n    </author>\n    <link href="http://arxiv.org/abs/2308.08241v2" rel="alternate" type="text/html"/>\n    <link title="pdf" href="http://arxiv.org/pdf/2308.08241v2" rel="related" type="application/pdf"/>\n    <arxiv:primary_category xmlns:arxiv="http://arxiv.org/schemas/atom" term="cs.CL" scheme="http://arxiv.org/schemas/atom"/>\n    <category term="cs.CL" scheme="http://arxiv.org/schemas/atom"/>\n    <category term="cs.AI" scheme="http://arxiv.org/schemas/atom"/>\n  </entry>\n  <entry>\n    <id>http://arxiv.org/abs/2401.12794v1</id>\n    <updated>2024-01-23T14:29:17Z</updated>\n    <published>2024-01-23T14:29:17Z</published>\n    <title>Benchmarking LLMs via Uncertainty Quantification</title>\n    <summary>  The proliferation of open-source Large Language Models (LLMs) from various\ninstitutions has highlighted the urgent need for comprehensive evaluation\nmethods. However, current evaluation platforms, such as the widely recognized\nHuggingFace open LLM leaderboard, neglect a crucial aspect -- uncertainty,\nwhich is vital for thoroughly assessing LLMs. To bridge this gap, we introduce\na new benchmarking approach for LLMs that integrates uncertainty\nquantification. Our examination involves eight LLMs (LLM series) spanning five\nrepresentative natural language processing tasks. Additionally, we introduce an\nuncertainty-aware evaluation metric, UAcc, which takes into account both\nprediction accuracy and prediction uncertainty. Our findings reveal that: I)\nLLMs with higher accuracy may exhibit lower certainty; II) Larger-scale LLMs\nmay display greater uncertainty compared to their smaller counterparts; and\nIII) Instruction-finetuning tends to increase the uncertainty of LLMs. By\ntaking uncertainty into account, our new UAcc metric can either amplify or\ndiminish the relative improvement of one LLM over another and may even change\nthe relative ranking of two LLMs. These results underscore the significance of\nincorporating uncertainty in the evaluation of LLMs.\n</summary>\n    <author>\n      <name>Fanghua Ye</name>\n    </author>\n    <author>\n      <name>Mingming Yang</name>\n    </author>\n    <author>\n      <name>Jianhui Pang</name>\n    </author>\n    <author>\n      <name>Longyue Wang</name>\n    </author>\n    <author>\n      <name>Derek F. Wong</name>\n    </author>\n    <author>\n      <name>Emine Yilmaz</name>\n    </author>\n    <author>\n      <name>Shuming Shi</name>\n    </author>\n    <author>\n      <name>Zhaopeng Tu</name>\n    </author>\n    <arxiv:comment xmlns:arxiv="http://arxiv.org/schemas/atom">24 pages, preprints</arxiv:comment>\n    <link href="http://arxiv.org/abs/2401.12794v1" rel="alternate" type="text/html"/>\n    <link title="pdf" href="http://arxiv.org/pdf/2401.12794v1" rel="related" type="application/pdf"/>\n    <arxiv:primary_category xmlns:arxiv="http://arxiv.org/schemas/atom" term="cs.CL" scheme="http://arxiv.org/schemas/atom"/>\n    <category term="cs.CL" scheme="http://arxiv.org/schemas/atom"/>\n  </entry>\n  <entry>\n    <id>http://arxiv.org/abs/2402.14016v1</id>\n    <updated>2024-02-21T18:55:20Z</updated>\n    <published>2024-02-21T18:55:20Z</published>\n    <title>Is LLM-as-a-Judge Robust? Investigating Universal Adversarial Attacks on\n  Zero-shot LLM Assessment</title>\n    <summary>  Large Language Models (LLMs) are powerful zero-shot assessors and are\nincreasingly used in real-world situations such as for written exams or\nbenchmarking systems. Despite this, no existing work has analyzed the\nvulnerability of judge-LLMs against adversaries attempting to manipulate\noutputs. This work presents the first study on the adversarial robustness of\nassessment LLMs, where we search for short universal phrases that when appended\nto texts can deceive LLMs to provide high assessment scores. Experiments on\nSummEval and TopicalChat demonstrate that both LLM-scoring and pairwise\nLLM-comparative assessment are vulnerable to simple concatenation attacks,\nwhere in particular LLM-scoring is very susceptible and can yield maximum\nassessment scores irrespective of the input text quality. Interestingly, such\nattacks are transferable and phrases learned on smaller open-source LLMs can be\napplied to larger closed-source models, such as GPT3.5. This highlights the\npervasive nature of the adversarial vulnerabilities across different judge-LLM\nsizes, families and methods. Our findings raise significant concerns on the\nreliability of LLMs-as-a-judge methods, and underscore the importance of\naddressing vulnerabilities in LLM assessment methods before deployment in\nhigh-stakes real-world scenarios.\n</summary>\n    <author>\n      <name>Vyas Raina</name>\n    </author>\n    <author>\n      <name>Adian Liusie</name>\n    </author>\n    <author>\n      <name>Mark Gales</name>\n    </author>\n    <link href="http://arxiv.org/abs/2402.14016v1" rel="alternate" type="text/html"/>\n    <link title="pdf" href="http://arxiv.org/pdf/2402.14016v1" rel="related" type="application/pdf"/>\n    <arxiv:primary_category xmlns:arxiv="http://arxiv.org/schemas/atom" term="cs.CL" scheme="http://arxiv.org/schemas/atom"/>\n    <category term="cs.CL" scheme="http://arxiv.org/schemas/atom"/>\n  </entry>\n  <entry>\n    <id>http://arxiv.org/abs/2402.18050v1</id>\n    <updated>2024-02-28T04:58:07Z</updated>\n    <published>2024-02-28T04:58:07Z</published>\n    <title>MEGAnno+: A Human-LLM Collaborative Annotation System</title>\n    <summary>  Large language models (LLMs) can label data faster and cheaper than humans\nfor various NLP tasks. Despite their prowess, LLMs may fall short in\nunderstanding of complex, sociocultural, or domain-specific context,\npotentially leading to incorrect annotations. Therefore, we advocate a\ncollaborative approach where humans and LLMs work together to produce reliable\nand high-quality labels. We present MEGAnno+, a human-LLM collaborative\nannotation system that offers effective LLM agent and annotation management,\nconvenient and robust LLM annotation, and exploratory verification of LLM\nlabels by humans.\n</summary>\n    <author>\n      <name>Hannah Kim</name>\n    </author>\n    <author>\n      <name>Kushan Mitra</name>\n    </author>\n    <author>\n      <name>Rafael Li Chen</name>\n    </author>\n    <author>\n      <name>Sajjadur Rahman</name>\n    </author>\n    <author>\n      <name>Dan Zhang</name>\n    </author>\n    <arxiv:comment xmlns:arxiv="http://arxiv.org/schemas/atom">EACL 2024 Demo</arxiv:comment>\n    <link href="http://arxiv.org/abs/2402.18050v1" rel="alternate" type="text/html"/>\n    <link title="pdf" href="http://arxiv.org/pdf/2402.18050v1" rel="related" type="application/pdf"/>\n    <arxiv:primary_category xmlns:arxiv="http://arxiv.org/schemas/atom" term="cs.CL" scheme="http://arxiv.org/schemas/atom"/>\n    <category term="cs.CL" scheme="http://arxiv.org/schemas/atom"/>\n    <category term="cs.HC" scheme="http://arxiv.org/schemas/atom"/>\n  </entry>\n  <entry>\n    <id>http://arxiv.org/abs/2402.08030v1</id>\n    <updated>2024-02-12T19:49:58Z</updated>\n    <published>2024-02-12T19:49:58Z</published>\n    <title>Why and When LLM-Based Assistants Can Go Wrong: Investigating the\n  Effectiveness of Prompt-Based Interactions for Software Help-Seeking</title>\n    <summary>  Large Language Model (LLM) assistants, such as ChatGPT, have emerged as\npotential alternatives to search methods for helping users navigate complex,\nfeature-rich software. LLMs use vast training data from domain-specific texts,\nsoftware manuals, and code repositories to mimic human-like interactions,\noffering tailored assistance, including step-by-step instructions. In this\nwork, we investigated LLM-generated software guidance through a within-subject\nexperiment with 16 participants and follow-up interviews. We compared a\nbaseline LLM assistant with an LLM optimized for particular software contexts,\nSoftAIBot, which also offered guidelines for constructing appropriate prompts.\nWe assessed task completion, perceived accuracy, relevance, and trust.\nSurprisingly, although SoftAIBot outperformed the baseline LLM, our results\nrevealed no significant difference in LLM usage and user perceptions with or\nwithout prompt guidelines and the integration of domain context. Most users\nstruggled to understand how the prompt\'s text related to the LLM\'s responses\nand often followed the LLM\'s suggestions verbatim, even if they were incorrect.\nThis resulted in difficulties when using the LLM\'s advice for software tasks,\nleading to low task completion rates. Our detailed analysis also revealed that\nusers remained unaware of inaccuracies in the LLM\'s responses, indicating a gap\nbetween their lack of software expertise and their ability to evaluate the\nLLM\'s assistance. With the growing push for designing domain-specific LLM\nassistants, we emphasize the importance of incorporating explainable,\ncontext-aware cues into LLMs to help users understand prompt-based\ninteractions, identify biases, and maximize the utility of LLM assistants.\n</summary>\n    <author>\n      <name>Anjali Khurana</name>\n    </author>\n    <author>\n      <name>Hari Subramonyam</name>\n    </author>\n    <author>\n      <name>Parmit K Chilana</name>\n    </author>\n    <arxiv:doi xmlns:arxiv="http://arxiv.org/schemas/atom">10.1145/3640543.3645200</arxiv:doi>\n    <link title="doi" href="http://dx.doi.org/10.1145/3640543.3645200" rel="related"/>\n    <arxiv:comment xmlns:arxiv="http://arxiv.org/schemas/atom">Accepted for publication in the Proceedings of the 29th International\n  Conference on Intelligent User Interfaces (IUI\'24), March 18--21, 2024, in\n  Greenville, SC, USA</arxiv:comment>\n    <link href="http://arxiv.org/abs/2402.08030v1" rel="alternate" type="text/html"/>\n    <link title="pdf" href="http://arxiv.org/pdf/2402.08030v1" rel="related" type="application/pdf"/>\n    <arxiv:primary_category xmlns:arxiv="http://arxiv.org/schemas/atom" term="cs.HC" scheme="http://arxiv.org/schemas/atom"/>\n    <category term="cs.HC" scheme="http://arxiv.org/schemas/atom"/>\n    <category term="cs.AI" scheme="http://arxiv.org/schemas/atom"/>\n    <category term="cs.LG" scheme="http://arxiv.org/schemas/atom"/>\n  </entry>\n  <entry>\n    <id>http://arxiv.org/abs/2307.09793v1</id>\n    <updated>2023-07-19T07:17:43Z</updated>\n    <published>2023-07-19T07:17:43Z</published>\n    <title>On the Origin of LLMs: An Evolutionary Tree and Graph for 15,821 Large\n  Language Models</title>\n    <summary>  Since late 2022, Large Language Models (LLMs) have become very prominent with\nLLMs like ChatGPT and Bard receiving millions of users. Hundreds of new LLMs\nare announced each week, many of which are deposited to Hugging Face, a\nrepository of machine learning models and datasets. To date, nearly 16,000 Text\nGeneration models have been uploaded to the site. Given the huge influx of\nLLMs, it is of interest to know which LLM backbones, settings, training\nmethods, and families are popular or trending. However, there is no\ncomprehensive index of LLMs available. We take advantage of the relatively\nsystematic nomenclature of Hugging Face LLMs to perform hierarchical clustering\nand identify communities amongst LLMs using n-grams and term frequency-inverse\ndocument frequency. Our methods successfully identify families of LLMs and\naccurately cluster LLMs into meaningful subgroups. We present a public web\napplication to navigate and explore Constellation, our atlas of 15,821 LLMs.\nConstellation rapidly generates a variety of visualizations, namely\ndendrograms, graphs, word clouds, and scatter plots. Constellation is available\nat the following link: https://constellation.sites.stanford.edu/.\n</summary>\n    <author>\n      <name>Sarah Gao</name>\n    </author>\n    <author>\n      <name>Andrew Kean Gao</name>\n    </author>\n    <arxiv:comment xmlns:arxiv="http://arxiv.org/schemas/atom">14 pages, 6 figures, 1 table</arxiv:comment>\n    <link href="http://arxiv.org/abs/2307.09793v1" rel="alternate" type="text/html"/>\n    <link title="pdf" href="http://arxiv.org/pdf/2307.09793v1" rel="related" type="application/pdf"/>\n    <arxiv:primary_category xmlns:arxiv="http://arxiv.org/schemas/atom" term="cs.DL" scheme="http://arxiv.org/schemas/atom"/>\n    <category term="cs.DL" scheme="http://arxiv.org/schemas/atom"/>\n    <category term="cs.CL" scheme="http://arxiv.org/schemas/atom"/>\n    <category term="I.2.1; H.5.0" scheme="http://arxiv.org/schemas/atom"/>\n  </entry>\n  <entry>\n    <id>http://arxiv.org/abs/2311.05656v1</id>\n    <updated>2023-11-09T00:05:27Z</updated>\n    <published>2023-11-09T00:05:27Z</published>\n    <title>Combating Misinformation in the Age of LLMs: Opportunities and\n  Challenges</title>\n    <summary>  Misinformation such as fake news and rumors is a serious threat on\ninformation ecosystems and public trust. The emergence of Large Language Models\n(LLMs) has great potential to reshape the landscape of combating\nmisinformation. Generally, LLMs can be a double-edged sword in the fight. On\nthe one hand, LLMs bring promising opportunities for combating misinformation\ndue to their profound world knowledge and strong reasoning abilities. Thus, one\nemergent question is: how to utilize LLMs to combat misinformation? On the\nother hand, the critical challenge is that LLMs can be easily leveraged to\ngenerate deceptive misinformation at scale. Then, another important question\nis: how to combat LLM-generated misinformation? In this paper, we first\nsystematically review the history of combating misinformation before the advent\nof LLMs. Then we illustrate the current efforts and present an outlook for\nthese two fundamental questions respectively. The goal of this survey paper is\nto facilitate the progress of utilizing LLMs for fighting misinformation and\ncall for interdisciplinary efforts from different stakeholders for combating\nLLM-generated misinformation.\n</summary>\n    <author>\n      <name>Canyu Chen</name>\n    </author>\n    <author>\n      <name>Kai Shu</name>\n    </author>\n    <arxiv:comment xmlns:arxiv="http://arxiv.org/schemas/atom">9 pages for the main paper, 35 pages including 656 references, more\n  resources on "LLMs Meet Misinformation" are on the website:\n  https://llm-misinformation.github.io/</arxiv:comment>\n    <link href="http://arxiv.org/abs/2311.05656v1" rel="alternate" type="text/html"/>\n    <link title="pdf" href="http://arxiv.org/pdf/2311.05656v1" rel="related" type="application/pdf"/>\n    <arxiv:primary_category xmlns:arxiv="http://arxiv.org/schemas/atom" term="cs.CY" scheme="http://arxiv.org/schemas/atom"/>\n    <category term="cs.CY" scheme="http://arxiv.org/schemas/atom"/>\n  </entry>\n  <entry>\n    <id>http://arxiv.org/abs/2311.15759v1</id>\n    <updated>2023-11-27T12:29:20Z</updated>\n    <published>2023-11-27T12:29:20Z</published>\n    <title>Towards Vision Enhancing LLMs: Empowering Multimodal Knowledge Storage\n  and Sharing in LLMs</title>\n    <summary>  Recent advancements in multimodal large language models (MLLMs) have achieved\nsignificant multimodal generation capabilities, akin to GPT-4. These models\npredominantly map visual information into language representation space,\nleveraging the vast knowledge and powerful text generation abilities of LLMs to\nproduce multimodal instruction-following responses. We could term this method\nas LLMs for Vision because of its employing LLMs for visual-language\nunderstanding, yet observe that these MLLMs neglect the potential of harnessing\nvisual knowledge to enhance overall capabilities of LLMs, which could be\nregraded as Vision Enhancing LLMs. In this paper, we propose an approach called\nMKS2, aimed at enhancing LLMs through empowering Multimodal Knowledge Storage\nand Sharing in LLMs. Specifically, we introduce the Modular Visual Memory, a\ncomponent integrated into the internal blocks of LLMs, designed to store\nopen-world visual information efficiently. Additionally, we present a soft\nMixtures-of-Multimodal Experts architecture in LLMs to invoke multimodal\nknowledge collaboration during generation. Our comprehensive experiments\ndemonstrate that MKS2 substantially augments the reasoning capabilities of LLMs\nin contexts necessitating physical or commonsense knowledge. It also delivers\ncompetitive results on multimodal benchmarks.\n</summary>\n    <author>\n      <name>Yunxin Li</name>\n    </author>\n    <author>\n      <name>Baotian Hu</name>\n    </author>\n    <author>\n      <name>Wei Wang</name>\n    </author>\n    <author>\n      <name>Xiaochun Cao</name>\n    </author>\n    <author>\n      <name>Min Zhang</name>\n    </author>\n    <arxiv:comment xmlns:arxiv="http://arxiv.org/schemas/atom">12 pages, 4 figures</arxiv:comment>\n    <link href="http://arxiv.org/abs/2311.15759v1" rel="alternate" type="text/html"/>\n    <link title="pdf" href="http://arxiv.org/pdf/2311.15759v1" rel="related" type="application/pdf"/>\n    <arxiv:primary_category xmlns:arxiv="http://arxiv.org/schemas/atom" term="cs.CL" scheme="http://arxiv.org/schemas/atom"/>\n    <category term="cs.CL" scheme="http://arxiv.org/schemas/atom"/>\n    <category term="cs.AI" scheme="http://arxiv.org/schemas/atom"/>\n    <category term="cs.CV" scheme="http://arxiv.org/schemas/atom"/>\n  </entry>\n  <entry>\n    <id>http://arxiv.org/abs/2402.10517v1</id>\n    <updated>2024-02-16T09:06:06Z</updated>\n    <published>2024-02-16T09:06:06Z</published>\n    <title>Any-Precision LLM: Low-Cost Deployment of Multiple, Different-Sized LLMs</title>\n    <summary>  Recently, considerable efforts have been directed towards compressing Large\nLanguage Models (LLMs), which showcase groundbreaking capabilities across\ndiverse applications but entail significant deployment costs due to their large\nsizes. Meanwhile, much less attention has been given to mitigating the costs\nassociated with deploying multiple LLMs of varying sizes despite its practical\nsignificance. Thus, this paper introduces \\emph{any-precision LLM}, extending\nthe concept of any-precision DNN to LLMs. Addressing challenges in\nany-precision LLM, we propose a lightweight method for any-precision\nquantization of LLMs, leveraging a post-training quantization framework, and\ndevelop a specialized software engine for its efficient serving. As a result,\nour solution significantly reduces the high costs of deploying multiple,\ndifferent-sized LLMs by overlaying LLMs quantized to varying bit-widths, such\nas 3, 4, ..., $n$ bits, into a memory footprint comparable to a single $n$-bit\nLLM. All the supported LLMs with varying bit-widths demonstrate\nstate-of-the-art model quality and inference throughput, proving itself to be a\ncompelling option for deployment of multiple, different-sized LLMs. The source\ncode will be publicly available soon.\n</summary>\n    <author>\n      <name>Yeonhong Park</name>\n    </author>\n    <author>\n      <name>Jake Hyun</name>\n    </author>\n    <author>\n      <name>SangLyul Cho</name>\n    </author>\n    <author>\n      <name>Bonggeun Sim</name>\n    </author>\n    <author>\n      <name>Jae W. Lee</name>\n    </author>\n    <link href="http://arxiv.org/abs/2402.10517v1" rel="alternate" type="text/html"/>\n    <link title="pdf" href="http://arxiv.org/pdf/2402.10517v1" rel="related" type="application/pdf"/>\n    <arxiv:primary_category xmlns:arxiv="http://arxiv.org/schemas/atom" term="cs.LG" scheme="http://arxiv.org/schemas/atom"/>\n    <category term="cs.LG" scheme="http://arxiv.org/schemas/atom"/>\n  </entry>\n</feed>\n'

One option to move forward is to use the xmltodict package that transaltes an XML string into a python dictionary for futher processing.

dictionary = xmltodict.parse(req.content)
dictionary.keys()
dict_keys(['feed'])
for entry in dictionary['feed']['entry']:
    print(entry['id'], entry['title'], entry['published'])
http://arxiv.org/abs/2311.10372v2 A Survey of Large Language Models for Code: Evolution, Benchmarking, and
  Future Trends 2023-11-17T07:55:16Z
http://arxiv.org/abs/2308.08241v2 TEST: Text Prototype Aligned Embedding to Activate LLM's Ability for
  Time Series 2023-08-16T09:16:02Z
http://arxiv.org/abs/2401.12794v1 Benchmarking LLMs via Uncertainty Quantification 2024-01-23T14:29:17Z
http://arxiv.org/abs/2402.14016v1 Is LLM-as-a-Judge Robust? Investigating Universal Adversarial Attacks on
  Zero-shot LLM Assessment 2024-02-21T18:55:20Z
http://arxiv.org/abs/2402.18050v1 MEGAnno+: A Human-LLM Collaborative Annotation System 2024-02-28T04:58:07Z
http://arxiv.org/abs/2402.08030v1 Why and When LLM-Based Assistants Can Go Wrong: Investigating the
  Effectiveness of Prompt-Based Interactions for Software Help-Seeking 2024-02-12T19:49:58Z
http://arxiv.org/abs/2307.09793v1 On the Origin of LLMs: An Evolutionary Tree and Graph for 15,821 Large
  Language Models 2023-07-19T07:17:43Z
http://arxiv.org/abs/2311.05656v1 Combating Misinformation in the Age of LLMs: Opportunities and
  Challenges 2023-11-09T00:05:27Z
http://arxiv.org/abs/2311.15759v1 Towards Vision Enhancing LLMs: Empowering Multimodal Knowledge Storage
  and Sharing in LLMs 2023-11-27T12:29:20Z
http://arxiv.org/abs/2402.10517v1 Any-Precision LLM: Low-Cost Deployment of Multiple, Different-Sized LLMs 2024-02-16T09:06:06Z
import pandas as pd
pd.DataFrame([
    {'url': entry['id'], 'title': entry['title'], 'date': entry['published']}
    for entry in dictionary['feed']['entry']
])
url title date
0 http://arxiv.org/abs/2311.10372v2 A Survey of Large Language Models for Code: Ev... 2023-11-17T07:55:16Z
1 http://arxiv.org/abs/2308.08241v2 TEST: Text Prototype Aligned Embedding to Acti... 2023-08-16T09:16:02Z
2 http://arxiv.org/abs/2401.12794v1 Benchmarking LLMs via Uncertainty Quantification 2024-01-23T14:29:17Z
3 http://arxiv.org/abs/2402.14016v1 Is LLM-as-a-Judge Robust? Investigating Univer... 2024-02-21T18:55:20Z
4 http://arxiv.org/abs/2402.18050v1 MEGAnno+: A Human-LLM Collaborative Annotation... 2024-02-28T04:58:07Z
5 http://arxiv.org/abs/2402.08030v1 Why and When LLM-Based Assistants Can Go Wrong... 2024-02-12T19:49:58Z
6 http://arxiv.org/abs/2307.09793v1 On the Origin of LLMs: An Evolutionary Tree an... 2023-07-19T07:17:43Z
7 http://arxiv.org/abs/2311.05656v1 Combating Misinformation in the Age of LLMs: O... 2023-11-09T00:05:27Z
8 http://arxiv.org/abs/2311.15759v1 Towards Vision Enhancing LLMs: Empowering Mult... 2023-11-27T12:29:20Z
9 http://arxiv.org/abs/2402.10517v1 Any-Precision LLM: Low-Cost Deployment of Mult... 2024-02-16T09:06:06Z

Remaining tasks: construct from this dictoinary of publications a clean pandas dataframe

  • one dataframe for the articles
  • one dataframe for authors (with a column identifying the articles)