최신 논문 – 소형 언어 모델(SLM) 관련 특허

This week: aluminum alloy, selective laser melting, intermetallic lamellae, high-strength, Quantization, LoRA, fine-tuning, LLM, marine machine equipment, operational advice, Small Language Model, real-time data, speculative decoding, LLM inference, token sequence selection, text data statistics, mechanism synthesis, deep learning, contrastive graph learning, optimization stability, Fine-Tuning, Discrete Wavelet Transform, Low-Rank Adaptation, Automatic Speech Recognition, Selective Laser Melting, Surface Roughness, Process Parameters, Response Surface Methodology, LLM, semantic inference, spatial analysis, fine-grained POI

6월 18, 2026

인공지능(AI), 딥러닝, 엣지 AI 추론, 임베디드 시스템, 생성형 사전 학습 트랜스포머(GPT), 머신러닝, 자연어 처리(NLP), 신경망, 시스템 온 칩(SoC)

팁: 아래 선택 항목 외에도, 저희가 제공하는 두 개의 전체 데이터베이스를 검색하고 필터링할 수 있습니다.

> 무료 간행물 검색 도구 < 저자, 주제, 키워드, 날짜 또는 저널별로 검색.

> 무료 특허 검색 도구 < 유럽 특허청의 영문 특허를 참조하십시오.

Small language models — 소규모 언어 모델은 효율적인 사용을 가능하게 합니다. 자연어 처리 소비자 기기 및 임베디드 기기에서.

소규모 언어 모델은 약 70억 개 미만의 매개변수로 작동하는 트랜스포머 기반 자연어 처리 시스템을 의미합니다. 이 임계값은 형식적인 경계라기보다는 클라우드 추론 인프라 없이 소비자 하드웨어, 모바일 장치 및 임베디드 시스템에 배포할 수 있다는 실질적인 제약 조건에 의해 정의됩니다.

이 분야는 최첨단 모델의 계산 및 경제적 비용에 대한 직접적인 대응으로 등장했습니다. 수십억 개 이상의 매개변수를 가진 아키텍처는 광범위한 일반 기능을 보여주지만, 메모리 사용량, 추론 지연 시간 및 에너지 소비량으로 인해 온디바이스 배포, 개인 정보 보호에 민감한 애플리케이션, 저대역폭 또는 오프라인 운영 환경과 구조적으로 호환되지 않습니다.

핵심 연구 프로그램은 지식 증류(더 큰 교사 모델의 출력 분포를 기반으로 더 작은 학생 모델을 학습시키는 것), 구조적 및 비구조적 가지치기, INT4 및 INT8 표현으로의 적극적인 가중치 양자화, 그리고 최소한의 추가 컴퓨팅 비용으로 압축된 기본 모델을 도메인별 작업에 맞게 조정하는 LoRA 및 QLoRA와 같은 매개변수 효율적인 미세 조정 방법을 결합하여 소형 모델과 최첨단 모델 간의 성능 격차를 해소하는 데 주력하고 있습니다.

아래에 색인된 논문 및 특허는 모델 압축 기술, 양자화 알고리즘, 증류 프로토콜, 효율적인 변환기 아키텍처, 온디바이스 추론 최적화 및 도메인별 미세 조정 파이프라인을 다룹니다.

본 자료는 소규모 언어 모델(SLM)에 관한 전 세계 영어 논문 및 특허를 엄선하여 정리한 것으로, 다양한 온라인 과학 저널에 게재된 자료들을 포함합니다. 주요 주제는 소규모 언어 모델(SLM), 온디바이스 언어 모델, 에지 언어 모델, 컴팩트 트랜스포머, 70억 미만 파라미터 모델, 언어 모델 압축, 지식 증류 자연어 처리, 구조적 가지치기 언어 모델, 비구조적 가지치기 언어 모델, 가중치 양자화 언어 모델, INT4 양자화 자연어 처리, INT8 양자화 자연어 처리, 파라미터 효율적 미세 조정, LoRA 미세 조정, QLoRA 미세 조정, 어댑터 튜닝 언어 모델, 온디바이스 추론, 에지 추론 자연어 처리, 투기적 디코딩, 모델 증류 트랜스포머, GGUF 양자화 형식, 전문가 혼합 컴팩트 모델입니다.

Deformable high-strength aluminum alloy compositions and methods of making the same

Patent published on the 2026-06-04 in US under Ref US20260152827 by PURDUE RES FOUNDATION [US] (Zhang Xinghang [us], Wang Haiyan [us], Stegman Benjamin Thomas [us], Shang Anyu [us])

Abstract: [0000] An alloy comprising 92 at % aluminum, 2 at % titanium, 2 at % iron, 2 at % cobalt, and 2 at % nickel. A method of making an alloy is disclosed. The method contains the steps of providing particles of desired composition, utilizing a selective leaser melting (SLM) apparatus producing a first layer of the particles on a substrate and melting and solidifying a first group selected areas of the layer of particles, wherein the melting and the solidification results in an alloy of desired compo[...]

Our summary: The content describes a high-strength aluminum alloy with specific composition percentages. It outlines a method for creating the alloy using selective laser melting to achieve desired thickness and shape. The process involves layering particles, melting, and solidifying selected areas to form intermetallic structures.

aluminum alloy, selective laser melting, intermetallic lamellae, high-strength

Patent

Quantization-aware lora fine-tuning for llm

Patent published on the 2026-06-04 in US under Ref US20260154540 by MEDIATEK SINGAPORE PTE LTD [SG] (Lim Jia Yao Christopher [sg], Huang Ya-lin [tw], Li Huai-ting [tw], Wong Wai Mun [sg], Liang Jen-wei [tw], Lee Timothy Jun Jie [sg])

Abstract: [0000] In an aspect of the disclosure, a method of using a LoRA for inference with a FC layer of a LLM is provided. The method includes: dequantizing an INT input to an FP output; processing the FP output from the DQ and a first FP input from first weights of a down projection module of the LoRA, to output a first FP output; processing the first FP output from the first BMM and a second FP input from second weights of an up projection module of the LoRA, to output a second FP output; quantizing [...]

Our summary: The method describes using LoRA for inference in a fully connected layer of a large language model. It involves dequantizing inputs, processing them through down and up projection modules, and quantizing outputs. The final output is an INT inference result derived from the LoRA adjustments.

Quantization, LoRA, fine-tuning, LLM

Patent

Systems and methods for assisting operation and maintenance of marine machine equipment

Patent published on the 2026-06-03 in EP under Ref EP4752805 by ALFA LAVAL CORP AB [SE] (Karlsson Jimmie [se], Boman Jesper [se])

Abstract: [0001] The present invention relates to a method of operating and maintaining a piece of marine machine equipment. The piece of marine machine equipment is connected to a local processor. The method comprising the steps of obtaining a set of training data specific to the piece of marine machine equipment and training a Small Language Model (SLM) with the set of training data specific to the piece of marine machine equipment. The method further comprising the step of executing the trained SLM on [...]

Our summary: The invention describes a method for operating and maintaining marine machine equipment using a local processor. It involves training a Small Language Model (SLM) with specific training data for the equipment. The trained SLM provides offline operational advice utilizing real-time data from the equipment.

marine machine equipment, operational advice, Small Language Model, real-time data

Patent

Parameter-free method for efficient and accurate llm inference acceleration via speculative decoding

Patent published on the 2026-05-07 in WO under Ref WO2026092843 by MARZOLLO MICHELE [DE] (Marzollo Michele [de], Mueller Lorenz [de], Zhuang Jiawei [de], Roemer Niklas [de], Cavigelli Lukas [de])

Abstract: In some examples, apparatus and methods are provided for selecting a draft token sequence for verification by using a large language model, LLM. Different sources of statistics on text data (prompt, generated output, large dataset of text data) can be utilized in order to choose candidates to use for speculative decoding via look-ups.[...]

Our summary: This method accelerates LLM inference without parameters by using speculative decoding. It selects draft token sequences for verification through statistical analysis of text data. The approach utilizes various sources of statistics to optimize candidate selection for decoding.

speculative decoding, LLM inference, token sequence selection, text data statistics

Patent

Automated synthesis of planar linkage mechanisms with diverse joint types via spring-connected link models and contrastive graph learning

Published on 2026-03-28 by @OXFORD

Abstract: AbstractThe automated synthesis of planar linkage mechanisms has long been a challenge in mechanism design, requiring both geometric feasibility and motion accuracy. Recent advances in data-driven and neural network–based methods have shown promise in automating linkage synthesis, improving efficiency and scalability compared to traditional analytical or optimization-based techniques. Nevertheless, existing data-driven approaches remain limited in handling diverse joint configurations and ofte[...]

Our summary: This study presents a framework for automating the synthesis of planar linkage mechanisms using deep learning and physics-based modeling. It employs a spring-connected link model for diverse joint configurations and utilizes contrastive graph learning for efficient linkage retrieval. The method demonstrates improved accuracy and optimization stability compared to traditional approaches.

mechanism synthesis, deep learning, contrastive graph learning, optimization stability

Publication

Enhancing Whisper Fine-Tuning with Discrete Wavelet Transform-Based LoRA Initialization

Published on 2026-01-29 by Liang Lan, Molin Fang, Yuxuan Chen, Daliang Wang, Wenyong Wang @MDPI

Abstract: In low-resource automatic speech recognition (ASR) scenarios, parameter-efficient fine-tuning (PEFT) has become a crucial approach for adapting large pre-trained speech models. Although low-rank adaptation (LoRA) offers clear advantages in efficiency, stability, and deployment friendliness, its performance remains constrained because random initialization fails to capture the time&ndash;frequency structural characteristics of speech signals. To address this limitation, this work proposes[...]

Our summary: This work introduces a structured initialization mechanism combining LoRA with discrete wavelet transform for fine-tuning in low-resource ASR. The proposed DWTLoRA method enhances convergence speed, stability, and accuracy by aligning with speech signal characteristics. Experimental results show DWTLoRA outperforms standard LoRA and other PEFT methods in character error rate and training efficiency.

Fine-Tuning, Discrete Wavelet Transform, Low-Rank Adaptation, Automatic Speech Recognition

Publication

Influence and Optimization of Process Parameters on Surface Roughness of Selective Laser Melting of 316L Stainless Steel

Published on 2026-01-20 by Pin Dong, Kamonpong Jamkamon, Suppawat Chuvaree @MDPI

Abstract: To achieve better surface quality in selective laser melting (SLM), this study used 316L stainless steel powder and conducted a systematic design experiment to investigate the influence mechanism of process parameters on the surface roughness of the top and vertical surfaces. Response surface methodology (RSM) was then used for parameter optimization. The results showed that scanning speed has the greatest impact on surface roughness, followed by laser power, while scanning spacing has the least[...]

Our summary: This study investigates the impact of process parameters on the surface roughness of 316L stainless steel in selective laser melting. Scanning speed significantly affects surface quality, with optimal conditions identified for minimal roughness. The findings validate the effectiveness of the response surface methodology used for parameter optimization.

Selective Laser Melting, Surface Roughness, Process Parameters, Response Surface Methodology

Publication

A Lightweight LLM-Based Semantic–Spatial Inference Framework for Fine-Grained Urban POI Analysis

Published on 2026-01-16 by Zhuo Huang, Yixing Guo, Shuo Huang, Miaoxi Zhao @MDPI

Abstract: Unstructured POI name texts are widely used in fine-grained urban analysis, yet missing labels and semantic ambiguity often limit their value for spatial inference. This study proposes a large language model-based semantic&ndash;spatial inference framework (LLM-SSIF), a lightweight semantic&ndash;spatial pipeline that translates POI texts into interpretable, fine-grained spatial evidence through an end-to-end workflow that couples scalable label expansion with scale-controlled sp[...]

Our summary: This study introduces LLM-SSIF, a lightweight framework for translating unstructured POI texts into spatial evidence. It employs LoRA-based fine-tuning for efficient adaptation and enhances label coverage. The model demonstrates strong performance in urban analysis, revealing cultural differences between cities.

LLM, semantic inference, spatial analysis, fine-grained POI

Publication

다룬 주제: 소형 언어 모델, 자연어 처리, 트랜스포머 기반 시스템, 파라미터 효율성, 지식 증류, 모델 압축, 구조적 가지치기, 비구조적 가지치기, 가중치 양자화, INT4, INT8, 미세 조정 방법, 온디바이스 배포, 추론 지연 시간, 에너지 소비, 개인 정보 보호에 민감한 애플리케이션, 저대역폭 작업, 오프라인 운영 환경, IEEE 80211, ISO/IEC 30170, ISO/IEC 27001, ISO/IEC 25010 및 NIST SP 800-53.

사용된 용어집

Natural Language Processing (NLP): 인공지능 분야 중 컴퓨터와 인간 언어 간의 상호작용에 초점을 맞춘 분야로, 기계가 자연어 텍스트나 음성을 이해하고 해석하며 생성할 수 있도록 합니다. 언어 번역, 감정 분석, 음성 인식과 같은 작업을 포함합니다.

Small Language Models (SLM): 소형 신경망은 자연어 처리 작업을 위해 설계되었으며, 일반적으로 대형 모델에 비해 매개변수 수가 적고 계산 요구 사항이 낮지만, 제한된 범위 내에서 일관성 있는 텍스트를 생성하고 문맥을 이해할 수 있는 기능을 갖추고 있습니다.

역사적 맥락

모드 잠금(레이저)

모드 잠금은 피코초(10⁻¹²초)에서 펨토초(10⁻¹⁵초) 정도의 극히 짧은 레이저 펄스를 생성하는 기술입니다. 이 기술은 레이저 공진기의 다양한 종방향 모드들이 고정된 위상 관계를 유지하며 진동하도록 함으로써 작동합니다. 이렇게 하면 모드들이 건설적인 간섭을 일으켜 공진기 내에서 순환하는 단일하고 강렬한 초단펄스를 생성합니다.

탑다운 나노소재 합성

탑다운 합성법은 더 큰 벌크 재료에서 시작하여 이를 나노 크기로 분해하거나 패턴화하여 나노 재료를 만드는 방법입니다. 주요 기술로는 볼 밀링과 같은 기계적 방법과 포토리소그래피, 전자빔 리소그래피, 나노임프린트 리소그래피와 같은 리소그래피 방법이 있습니다. 이러한 방법들은 구조화된 표면이나 집적 회로를 만드는 데 자주 사용되지만, 표면 결함이 발생할 수 있다는 단점이 있습니다.

플라이휠 에너지 저장(FES)

플라이휠 에너지 저장(FES)은 회전자(플라이휠)를 매우 빠른 속도로 가속시켜 회전 운동 에너지 형태로 시스템에 에너지를 저장하는 방식입니다. 저장된 에너지는 회전 속도의 제곱에 비례합니다. 에너지를 추출할 때는 플라이휠의 회전 속도가 느려집니다. 저장된 에너지의 공식은 (E = frac{1}{2} I omega^2)이며, 여기서 I는 관성 모멘트이고 ω는 각속도입니다.

분자 전자공학

분자 전자공학은 개별 분자 또는 나노 크기의 분자 집합체를 기본적인 전자 부품으로 활용하는 분야입니다. 이 접근법은 기존의 실리콘 기반 기술을 훨씬 뛰어넘는 초소형 회로 구축을 목표로 합니다. 주요 구성 요소로는 분자 와이어, 스위치, 정류기 등이 있으며, 분자 궤도를 통한 전자 터널링과 같은 양자 역학적 특성을 이용하여 기능을 구현합니다.

실패의 물리학(PoF)

고장 물리학(Physics of Failure, PoF)은 재료 과학 및 물리학 지식을 활용하여 고장의 근본 원인 메커니즘을 이해하고 모델링하는 신뢰성 공학 접근 방식입니다. 과거 고장 사례의 통계 데이터에만 의존하는 대신, 고장 물리학은 열화 및 파손으로 이어지는 물리적 과정(예: 피로, 부식, 크리프)을 분석하여 고장을 예측하는 데 중점을 둡니다.

나노물질에서의 양자 크기 효과

양자 크기 효과는 물질의 크기가 나노 규모에 가까워짐에 따라 전자적 및 광학적 특성이 변화하는 현상을 설명합니다. 물질의 크기가 전자의 드 브로이 파장과 비슷해지면 양자 구속이 발생합니다. 이로 인해 전자의 에너지 준위가 양자화되어 크기에 따라 달라지는 밴드 갭, (E_g(R) approx E_{g,bulk} + frac{hbar^2pi^2}{2R^2}(frac{1}{m_e^*} + frac{1}{m_h^*}))이 나타납니다.

증기압 증강 계수

습한 공기 중 액체 표면 위의 물의 평형 증기압((p^*_{H_2O,a}))은 순수한 물 표면 위의 평형 증기압((p^*_{H_2O}))보다 약간 더 큽니다. 이러한 차이는 수증기 증강 계수 (f_w)로 정량화되며, 이는 온도와 습한 공기의 압력에 따라 달라집니다. 그 관계식은 (p^*_{H_2O,a} = f_w(T, p_{ms}) cdot p^*_{H_2O})입니다.

1965

1970

1974-11-15

1980

1964

1968

1970

1975

1980

컬러 텔레비전 응용 분야를 위한 유로퓸 도핑 이트륨 바나데이트 형광체의 실험실 분석.

컬러 텔레비전용 유로퓸 형광체

유로퓸이 도핑된 이트륨 바나데이트(YVO₄:Eu³⁺)가 선명한 적색 형광체로 작용할 수 있다는 발견은 컬러 텔레비전에 있어 획기적인 발전이었습니다. 이전에는 적색 형광체가 약해서 색상이 흐릿하게 표현되었습니다. Eu³⁺ 이온에서 방출되는 강렬하고 좁은 대역폭의 적색 발광은 밝고 생생한 색상을 구현할 수 있게 해주었고, 컬러 TV의 화질을 획기적으로 향상시키고 디스플레이 기술의 표준을 정립했습니다.

베지어 곡선

1960년대 프랑스 엔지니어 피에르 베지에가 르노를 위해 개발한 UNISURF는 최초의 진정한 3D CAD/CAM 시스템 중 하나였습니다. 이 시스템의 핵심 혁신은 오늘날 베지에 곡선과 곡면으로 알려진 개념을 사용한 것입니다. 베지에 곡선은 일련의 제어점으로 정의되는 매개변수 곡선으로, 자동차 차체의 복잡한 자유형 형상을 직관적이고 수학적으로 생성할 수 있게 해줍니다.

GPS 수신기가 위성 신호와 거리 측정값을 전파 물리학적으로 표시하는 모습.

GPS 삼각측량 원리

GPS는 삼각측량법을 이용하여 수신기의 위치를 파악합니다. 최소 세 개의 위성까지의 거리를 측정함으로써 수신기는 지구 표면상의 정확한 위치를 알 수 있습니다. 거리는 신호의 이동 시간에 빛의 속도를 곱하여 계산됩니다. 네 번째 위성은 수신기의 시계를 동기화하고 위도, 경도, 고도, 시간이라는 네 가지 미지수를 결정하는 데 필요합니다.

초전도 자기 에너지 저장(SMES)

초전도 자기 에너지 저장(SMES) 시스템은 초전도 코일에 직류 전류가 흐르면서 생성되는 자기장에 에너지를 저장합니다. 코일이 초전도 온도로 유지되는 한, 전기 저항으로 인한 에너지 손실이 거의 없기 때문에 에너지를 무기한으로 저장할 수 있습니다. 저장된 에너지는 (E = frac{1}{2} LI^2)로 나타낼 수 있습니다.

색도 측정실에서 분광광도계를 사용하여 섬유의 백색도 지수를 측정하는 실험실 기술자.

간츠-그리서 백색도 지수

간츠-그리서 백색도 지수는 특히 섬유 산업에서 널리 사용되는 선형 공식입니다. 이 지수는 CIE 삼자극 색온도 값에서 유도되며 (W_{GG} = Y - Px - Qy + C)로 정의됩니다. 여기서 P, Q, C는 광원과 관찰자에 특정한 상수입니다. D65/10° 조건에서 공식은 (W_{GG} = Y - 1868.322x - 3695.690y + 1809.441)입니다.

리튬 이온 삽입 메커니즘

리튬 이온 배터리는 층상 구조의 기판 물질에 이온이 가역적으로 삽입되는 삽입 메커니즘을 통해 작동합니다. 방전 시, 리튬 이온(Li⁺)은 음극(일반적으로 흑연)에서 탈삽입되어 비수용성 전해질을 통해 양극(일반적으로 금속 산화물)으로 이동합니다. 전자는 외부 회로를 통해 이동하면서 전류를 생성합니다.

전기차의 배터리 방전 심도(Depth of Discharge) 지표를 보여주는 배터리 관리 시스템 인터페이스.

방전 심도(DoD)

방전 심도(DoD)는 배터리 용량 중 방전된 비율을 나타냅니다. 이는 충전 상태(SoC)의 역수이며, DoD가 100%인 경우 배터리가 완전히 방전된 상태입니다. 배터리의 수명은 평균 DoD에 크게 좌우됩니다. DoD가 낮을수록(예: 용량의 80%까지만 방전) 배터리의 수명이 크게 늘어납니다.

MEMS 스케일링 법칙

MEMS 스케일링 법칙은 소자의 크기가 마이크로 스케일로 축소됨에 따라 물리적 힘과 특성이 어떻게 변화하는지를 설명합니다. 중력과 관성이 지배하는 거시 세계와는 달리, 마이크로 영역에서는 표면 장력, 점성, 정전기력과 같은 표면력이 작용합니다. 예를 들어, 중력은 부피(L³)에 비례하고, 정전기력은 면적(L²)에 비례하여 크기가 작아질수록 상대적으로 강해집니다.

(날짜를 알 수 없거나 관련이 없는 경우, 예를 들어 "유체역학"의 경우, 주목할 만한 등장 시기를 대략적으로 추정하여 제공합니다.)