GPT-Neo

Natural Language ProcessingIntroduced 200038 papers

Description

An implementation of model & data parallel GPT3-like models using the mesh-tensorflow library.

Source: EleutherAI/GPT-Neo

Papers Using This Method

IRepair: An Intent-Aware Approach to Repair Data-Driven Errors in Large Language Models2025-02-10 Robust Hybrid Classical-Quantum Transfer Learning Model for Text Classification Using GPT-Neo 125M with LoRA & SMOTE Enhancement2025-01-12 LLM Vocabulary Compression for Low-Compute Environments2024-11-10 BERTtime Stories: Investigating the Role of Synthetic Story Data in Language pre-training2024-10-20 Reconstruction of Differentially Private Text Sanitization via Large Language Models2024-10-16 The Unreasonable Ineffectiveness of Nucleus Sampling on Mitigating Text Memorization2024-08-29 WPN: An Unlearning Method Based on N-pair Contrastive Learning in Language Models2024-08-18 Towards Robust and Parameter-Efficient Knowledge Unlearning for LLMs2024-08-13 Semantic Membership Inference Attack against Large Language Models2024-06-14 Investigating Wit, Creativity, and Detectability of Large Language Models in Domain-Specific Writing Style Adaptation of Reddit's Showerthoughts2024-05-02 More than Correlation: Do Large Language Models Learn Causal Representations of Space?2023-12-26 Fairness-Aware Structured Pruning in Transformers2023-12-24 Scalable Extraction of Training Data from (Production) Language Models2023-11-28 Heaps' Law in GPT-Neo Large Language Model Emulated Corpora2023-11-10 Watermarking LLMs with Weight Quantization2023-10-17 TART: A plug-and-play Transformer module for task-agnostic reasoning2023-09-21 Fine-Tuning Large Language Models for Answering Programming Questions with Code Snippets2023-06-26 Exposing Bias in Online Communities through Large-Scale Language Models2023-06-04 Test-Time Training on Nearest Neighbors for Large Language Models2023-05-29 Controlling the Extraction of Memorized Data from Large Language Models via Prompt-Tuning2023-05-19