Product Design, Manufacturing & Innovation Resources

Dernières publications et brevets sur les petits modèles de langage (SLM)

Modèles de langage de petite taille

Astuce : suite à la sélection ci-dessous, vous pouvez rechercher et filtrer nos 2 bases de données complètes :

> outil de recherche de publications gratuit < par auteur, sujet, mots-clés, date ou revue.

> outil gratuit de recherche de brevets < pour les brevets en anglais de l'Office européen des brevets.

Modèles de langage à petite échelle
Les petits modèles linguistiques permettent une traitement du langage naturel sur les appareils grand public et les appareils embarqués.

Les modèles de langage simplifiés désignent les systèmes de traitement du langage naturel basés sur des transformateurs fonctionnant avec moins de 7 milliards de paramètres environ — un seuil défini moins par une limite formelle que par la contrainte pratique de déploiement sur du matériel grand public, des appareils mobiles et des systèmes embarqués sans infrastructure d'inférence dans le cloud.

Ce domaine est apparu en réponse directe aux coûts de calcul et économiques des modèles à grande échelle : bien que les architectures à plus d'un milliard de paramètres démontrent une large capacité générale, leur empreinte mémoire, leur latence d'inférence et leur consommation d'énergie les rendent structurellement incompatibles avec le déploiement sur appareil, les applications sensibles à la confidentialité et les contextes opérationnels à faible bande passante ou hors ligne.

Le programme de recherche central vise à combler l'écart de capacité entre les modèles compacts et les modèles de pointe grâce à une combinaison de distillation des connaissances — l'entraînement d'un modèle étudiant plus petit par rapport aux distributions de sortie d'un modèle enseignant plus grand —, d'élagage structuré et non structuré, de quantification agressive des poids jusqu'aux représentations INT4 et INT8, et de méthodes de réglage fin efficaces en termes de paramètres telles que LoRA et QLoRA qui adaptent un modèle de base compressé à des tâches spécifiques au domaine à un coût de calcul supplémentaire minimal.

Les publications et brevets indexés ci-dessous traitent des techniques de compression de modèles, des algorithmes de quantification, des protocoles de distillation, des architectures de transformateurs efficaces, de l'optimisation de l'inférence sur l'appareil et des pipelines de réglage fin spécifiques au domaine :

Voici notre dernière sélection de publications et de brevets mondiaux en anglais sur les petits modèles de langage (SLM), parmi de nombreuses revues scientifiques en ligne, classées et axées sur petit modèle de langage, SLM, modèle de langage sur appareil, modèle de langage à la périphérie, transformateur compact, modèle à paramètres sub-7B, compression de modèle de langage, distillation de connaissances NLP, modèle de langage à élagage structuré, modèle de langage à quantification de poids, quantification NLP INT4, quantification NLP INT8, ajustement fin efficace des paramètres, ajustement fin LoRA, QLoRA, modèle linguistique d'élagage non structuré, modèle linguistique de quantification des poids, quantification INT4 NLP, quantification INT8 NLP, réglage fin efficace des paramètres, réglage fin LoRA, réglage fin QLoRA, modèle linguistique d'adaptation, inférence sur appareil, inférence NLP de bord, décodage spéculatif, transformateur de distillation de modèle, format de quantification GGUF et modèle compact de mélange d'experts.

Parameter-free method for efficient and accurate llm inference acceleration via speculative decoding

Patent published on the 2026-05-07 in WO under Ref WO2026092843 by MARZOLLO MICHELE [DE] (Marzollo Michele [de], Mueller Lorenz [de], Zhuang Jiawei [de], Roemer Niklas [de], Cavigelli Lukas [de])

Abstract: In some examples, apparatus and methods are provided for selecting a draft token sequence for verification by using a large language model, LLM. Different sources of statistics on text data (prompt, generated output, large dataset of text data) can be utilized in order to choose candidates to use for speculative decoding via look-ups.[...]


Our summary: This method accelerates LLM inference without parameters by using speculative decoding. It selects draft token sequences for verification through statistical analysis of text data. The approach utilizes various sources of statistics to optimize candidate selection for decoding.

speculative decoding, LLM inference, token sequence selection, text data statistics

Patent

Automated synthesis of planar linkage mechanisms with diverse joint types via spring-connected link models and contrastive graph learning

Published on 2026-03-28 by @OXFORD

Abstract: AbstractThe automated synthesis of planar linkage mechanisms has long been a challenge in mechanism design, requiring both geometric feasibility and motion accuracy. Recent advances in data-driven and neural network–based methods have shown promise in automating linkage synthesis, improving efficiency and scalability compared to traditional analytical or optimization-based techniques. Nevertheless, existing data-driven approaches remain limited in handling diverse joint configurations and ofte[...]


Our summary: This study presents a framework for automating the synthesis of planar linkage mechanisms using deep learning and physics-based modeling. It employs a spring-connected link model for diverse joint configurations and utilizes contrastive graph learning for efficient linkage retrieval. The method demonstrates improved accuracy and optimization stability compared to traditional approaches.

mechanism synthesis, deep learning, contrastive graph learning, optimization stability

Publication

Enhancing Whisper Fine-Tuning with Discrete Wavelet Transform-Based LoRA Initialization

Published on 2026-01-29 by Liang Lan, Molin Fang, Yuxuan Chen, Daliang Wang, Wenyong Wang @MDPI

Abstract: In low-resource automatic speech recognition (ASR) scenarios, parameter-efficient fine-tuning (PEFT) has become a crucial approach for adapting large pre-trained speech models. Although low-rank adaptation (LoRA) offers clear advantages in efficiency, stability, and deployment friendliness, its performance remains constrained because random initialization fails to capture the time&amp;ndash;frequency structural characteristics of speech signals. To address this limitation, this work proposes[...]


Our summary: This work introduces a structured initialization mechanism combining LoRA with discrete wavelet transform for fine-tuning in low-resource ASR. The proposed DWTLoRA method enhances convergence speed, stability, and accuracy by aligning with speech signal characteristics. Experimental results show DWTLoRA outperforms standard LoRA and other PEFT methods in character error rate and training efficiency.

Fine-Tuning, Discrete Wavelet Transform, Low-Rank Adaptation, Automatic Speech Recognition

Publication

A Systematic Evaluation and Adaptation of Large Language Models for Very Short-Term Power Load Forecasting

Published on 2026-01-26 by Yansheng Chen, Miao Chen, Chenchao Hu, Jinxi Wu, Ruilin Qin @MDPI

Abstract: Power load forecasting is critical for ensuring grid security and stability and optimizing energy resource allocation. The high integration of renewable energy poses significant challenges to traditional methods in data-scarce scenarios. Recently, Large Language Models (LLMs) have shown considerable potential in processing time-series data, yet their effectiveness in very short-term power load forecasting lacks systematic evaluation. This paper proposes a targeted prompt engineering framework an[...]


Our summary: This study evaluates the effectiveness of Large Language Models (LLMs) for very short-term power load forecasting. It proposes a targeted prompt engineering framework and introduces Ele-LLM, a model utilizing Low-Rank Adaptation for improved performance. Experimental results indicate Ele-LLM significantly outperforms traditional forecasting methods, demonstrating the potential of LLMs in this domain.

Large Language Models, Power Load Forecasting, Prompt Engineering, Low-Rank Adaptation

Publication

A Comparative Study and Introduction of a New Heat Source Model for the Macro-Scale Numerical Simulation of Selective Laser Melting Technology

Published on 2026-01-25 by Hao Zhang, Shuai Wang, Junjie Wang, Zhiqiang Yan @MDPI

Abstract: Selective Laser Melting (SLM), as a common metal additive manufacturing (AM) technology, achieves high-precision complex part formation by layer-by-layer melting of metal powder using a laser. However, the dynamic behavior of the melt pool during the SLM process is influenced by the heat source model, which is crucial for suppressing porosity defects and optimizing process parameters, directly determining the reliability of numerical simulations. To address the issue of traditional surface heat [...]


Our summary: This study introduces a new dynamic heat source model for Selective Laser Melting (SLM) to improve numerical simulations. It compares various heat source models and demonstrates that the dynamic model achieves greater accuracy in predicting melt pool dimensions. The results indicate significant improvements in both melt pool width and depth compared to traditional models.

Heat Source Model, Selective Laser Melting, Finite Element Analysis, Temperature Field

Publication

Influence and Optimization of Process Parameters on Surface Roughness of Selective Laser Melting of 316L Stainless Steel

Published on 2026-01-20 by Pin Dong, Kamonpong Jamkamon, Suppawat Chuvaree @MDPI

Abstract: To achieve better surface quality in selective laser melting (SLM), this study used 316L stainless steel powder and conducted a systematic design experiment to investigate the influence mechanism of process parameters on the surface roughness of the top and vertical surfaces. Response surface methodology (RSM) was then used for parameter optimization. The results showed that scanning speed has the greatest impact on surface roughness, followed by laser power, while scanning spacing has the least[...]


Our summary: This study investigates the impact of process parameters on the surface roughness of 316L stainless steel in selective laser melting. Scanning speed significantly affects surface quality, with optimal conditions identified for minimal roughness. The findings validate the effectiveness of the response surface methodology used for parameter optimization.

Selective Laser Melting, Surface Roughness, Process Parameters, Response Surface Methodology

Publication

Effect of Selectively Etched Al-Rich and Si-Rich Microstructures on the Adhesion of Polyimide Coatings to SLM AlSi10Mg

Published on 2026-01-18 by Jianzhu Li, Shuo Yang, Yujie Li @MDPI

Abstract: Interfacial adhesion between selective laser-melted (SLM) AlSi10Mg and polyimide (PI) insulating coatings is often limited by mismatched physicochemical properties. To improve adhesion, Al-rich and Si-rich microstructured surfaces were fabricated on the XY plane (perpendicular to the build direction) and the Z plane (parallel to the build direction) by acidic and alkaline etching, exploiting the characteristic microstructure of SLM AlSi10Mg. Surface topography, chemical composition, and wettabil[...]


Our summary: Selectively etched microstructures on AlSi10Mg improve adhesion to polyimide coatings. Al-rich surfaces showed cohesive failure, while Si-rich surfaces exhibited mixed failure modes. Shear and pull-off tests demonstrated significant increases in mechanical performance compared to polished surfaces.

adhesion, microstructures, polyimide, selective laser melting

Publication

Monolithic microelectromechanical systems based spatial light modulators including multiple arrays, each array configured to modulate different wavele

Patent published on the 2026-01-02 in WO under Ref WO2026006343 by SILICON LIGHT MACHINES CORP [US] (Hamann Stephen [us], Payne Alexander [us], Hunter James [us], Liu Tianbo [us], Eng Lars [us])

Abstract: A monolithic spatial light modulator (SLM) is provided. Generally, the SLM includes a substrate with a number of substrate electrodes in a surface thereof, multiple MEMS-based linear arrays formed on the surface of the substrate, and a drive circuit monolithically integrated in the substrate below the surface of the substrate. Each linear array includes multiple ribbons suspended above the surface of the substrate, each ribbon having a light reflective surface facing away from the surface of the[...]


Our summary: The content describes a monolithic spatial light modulator (SLM) that includes multiple MEMS-based linear arrays. Each array is designed to modulate different, non-overlapping wavelength ranges. The SLM integrates a drive circuit with substrate electrodes to control the electrostatically displaceable ribbons for light modulation.

MEMS, spatial light modulators, wave modulation, monolithic integration

Patent

Sujets abordés : Petits modèles linguistiques, traitement du langage naturel, systèmes basés sur des transformateurs, efficacité des paramètres, distillation des connaissances, compression des modèles, élagage structuré, élagage non structuré, quantification des poids, INT4, INT8, méthodes de réglage fin, déploiement sur l'appareil, latence d'inférence, consommation d'énergie, applications sensibles à la confidentialité, opérations à faible bande passante, contextes opérationnels hors ligne, IEEE 80211, ISO/IEC 30170, ISO/IEC 27001, ISO/IEC 25010, et NIST SP 800-53.

Glossaire des termes utilisés

Natural Language Processing (NLP): Domaine de l'intelligence artificielle axé sur l'interaction entre les ordinateurs et le langage humain, permettant aux machines de comprendre, d'interpréter et de générer du texte ou de la parole en langage naturel. Il englobe des tâches telles que la traduction, l'analyse des sentiments et la reconnaissance vocale.

Small Language Models (SLM): Les réseaux neuronaux compacts, conçus pour les tâches de traitement du langage naturel, se caractérisent généralement par un nombre réduit de paramètres et des exigences de calcul moindres par rapport aux modèles plus grands, tout en étant capables de générer un texte cohérent et de comprendre le contexte dans des limites limitées.

Contexte historique

1965
1970
1970
1974-11-15
1980
1980
1980
1964
1968
1970
1970
1975
1980
1980
1980

(si la date est inconnue ou non pertinente, par exemple « mécanique des fluides », une estimation arrondie de son émergence notable est fournie)

Les images en pleine résolution et les téléchargements sont uniquement disponibles, et 100% gratuits, pour les membres inscrits.