PennyLang: Pioneering LLM-Based Quantum Code Generation with a Novel PennyLane-Centric Datasetcs.AI updates on arXiv.org

_ August 8, 2025_ Tech Jacks Solutions_ 0 Comments

arXiv:2503.02497v3 Announce Type: replace-cross
Abstract: Large Language Models (LLMs) offer powerful capabilities in code generation, natural language understanding, and domain-specific reasoning. Their application to quantum software development remains limited, in part because of the lack of high-quality datasets both for LLM training and as dependable knowledge sources. To bridge this gap, we introduce PennyLang, an off-the-shelf, high-quality dataset of 3,347 PennyLane-specific quantum code samples with contextual descriptions, curated from textbooks, official documentation, and open-source repositories. Our contributions are threefold: (1) the creation and open-source release of PennyLang, a purpose-built dataset for quantum programming with PennyLane; (2) a framework for automated quantum code dataset construction that systematizes curation, annotation, and formatting to maximize downstream LLM usability; and (3) a baseline evaluation of the dataset across multiple open-source models, including ablation studies, all conducted within a retrieval-augmented generation (RAG) pipeline. Using PennyLang with RAG substantially improves performance: for example, Qwen 7B’s success rate rises from 8.7% without retrieval to 41.7% with full-context augmentation, and LLaMa 4 improves from 78.8% to 84.8%, while also reducing hallucinations and enhancing quantum code correctness. Moving beyond Qiskit-focused studies, we bring LLM-based tools and reproducible methods to PennyLane for advancing AI-assisted quantum development. Read More

Author

Gallery

Contacts

PennyLang: Pioneering LLM-Based Quantum Code Generation with a Novel PennyLane-Centric Datasetcs.AI updates on arXiv.org

Tech Jacks Solutions

Leave a comment Cancel reply

Our Address

Our Mailbox

Our Phone

Gallery

Contacts

PennyLang: Pioneering LLM-Based Quantum Code Generation with a Novel PennyLane-Centric Datasetcs.AI updates on arXiv.org

Tech Jacks Solutions

The Download: OpenAI’s open-weight models, and the future of internet search MIT Technology Review

Five ways that AI is learning to improve itselfMIT Technology Review

Leave a comment Cancel reply

Our Address

Our Mailbox

Our Phone