Esta es nuestra última selección de publicaciones y patentes mundiales en inglés sobre Large Language Models (LLM), entre muchas revistas científicas en línea, clasificadas y centradas en large language model, LLM, generative pre-trained transformer, pre-training, transformer architecture, gradient descent, GPT, tokenization, generative model, self-attention mechanism, masked language model y MLM.
Patentes: no reciente patentar sobre este tema concreto. Intente realizar una búsqueda manual exhaustiva en la base de datos de patentes a la que se hace referencia más arriba.
Improving Probability-based Prompt Selection Through Unified Evaluation and Analysis
Published on 2024-05-25 by Sohee Yang, Jonghyeon Kim, Joel Jang, Seonghyeon Ye, Hyunji Lee, Minjoon Seo @MIT
Abstract: Previous works in prompt engineering for large language models have introduced different gradient-free probability-based prompt selection methods that aim to choose the optimal prompt among the candidates for a given task but have failed to provide a comprehensive and fair comparison between each other. In this paper, we propose a unified framework to interpret and evaluate the existing probability-based prompt selection methods by performing extensive experiments on 13 common and diverse NLP ta[...]
Our summary: Evaluation of probability-based prompt selection methods through unified framework, Improving prompt selection effectiveness through combinatorial variants of mutual information, Introducing Calibration by Marginalization method for unbiased prompt selection, Achieving high performance in prompt selection without calibration by maximizing mutual information.
prompt selection, probability-based, unified evaluation, analysis, NLP tasks
Publication