【vLLM 学习】Structured Outputs-程序员充电站

vLLM 是一款专为大语言模型推理加速而设计的框架，实现了 KV 缓存内存几乎零浪费，解决了内存管理瓶颈问题。

更多 vLLM 中文文档及教程可访问 →https://go.hyper.ai/Wa62f

*在线运行 vLLM 入门教程：零基础分步指南

源码 examples/offline_inference/structured_outputs.py

from enum import Enum from pydantic import BaseModel from vllm import LLM, SamplingParams from vllm.sampling_params import GuidedDecodingParams llm = LLM(model="Qwen/Qwen2.5-3B-Instruct", max_model_len=100) # 使用候选选项列表的引导式解码 guided_decoding_params = GuidedDecodingParams(choice=["Positive", "Negative"]) sampling_params = SamplingParams(guided_decoding=guided_decoding_params) outputs = llm.generate( prompts="Classify this sentiment: vLLM is wonderful!", sampling_params=sampling_params, ) print(outputs[0].outputs[0].text) # 使用 Regex 的引导式解码 guided_decoding_params = GuidedDecodingParams(regex="\w+@\w+\.com\n") sampling_params = SamplingParams(guided_decoding=guided_decoding_params, stop=["\n"]) prompt = ("Generate an email address for Alan Turing, who works in Enigma." "End in .com and new line. Example result:" "alan.turing@enigma.com\n") outputs = llm.generate(prompts=prompt, sampling_params=sampling_params) print(outputs[0].outputs[0].text) # 使用 Pydantic 模式的 JSON 引导式解码 class CarType(str, Enum): sedan = "sedan" suv = "SUV" truck = "Truck" coupe = "Coupe" class CarDescription(BaseModel): brand: str model: str car_type: CarType json_schema = CarDescription.model_json_schema() guided_decoding_params = GuidedDecodingParams(json=json_schema) sampling_params = SamplingParams(guided_decoding=guided_decoding_params) prompt = ("Generate a JSON with the brand, model and car_type of" "the most iconic car from the 90's") outputs = llm.generate( prompts=prompt, sampling_params=sampling_params, ) print(outputs[0].outputs[0].text) # 使用 Grammar 的引导式解码 simplified_sql_grammar = """ ?start: select_statement ?select_statement: "SELECT " column_list " FROM " table_name ?column_list: column_name ("," column_name)* ?table_name: identifier ?column_name: identifier ?identifier: /[a-zA-Z_][a-zA-Z0-9_]*/ """ guided_decoding_params = GuidedDecodingParams(grammar=simplified_sql_grammar) sampling_params = SamplingParams(guided_decoding=guided_decoding_params) prompt = ("Generate an SQL query to show the 'username' and 'email'" "from the 'users' table.") outputs = llm.generate( prompts=prompt, sampling_params=sampling_params, ) print(outputs[0].outputs[0].text)

Pentaho Data Integration深度指南：从数据管道构建到企业级应用

Pentaho Data Integration深度指南：从数据管道构建到企业级应用【免费下载链接】pentaho-kettle pentaho/pentaho-kettle: 一个基于 Java 的数据集成和变换工具，用于实现数据仓库和数据湖的构建。适合用于大数据集成和变换场景，可以实现高效…

李华

好写作AI：当AI听懂你的“话里有话”，创作新时代真的来了

程序员兄弟，你是不是常遇到这种“史诗级需求”？ 产品经理说：“这个页面要做得高级一点，但别太复杂。” 你内心OS：“到底是要五彩斑斓的黑，还是要大道至简的白？”更扎心的是——你发现自己在写作…

李华

高效掌握PNG元数据编辑：TweakPNG工具全攻略

高效掌握PNG元数据编辑：TweakPNG工具全攻略【免费下载链接】tweakpng A low-level PNG image file manipulation utility for Windows 项目地址: https://gitcode.com/gh_mirrors/tw/tweakpng 在数字图像处理领域，PNG文件优化与元数据编辑工具的…

李华

拖延症福音！专科生专属降AI神器 —— 千笔·专业降AI率智能体

在AI技术迅速渗透学术写作领域的当下，越来越多的专科生开始借助AI工具提升论文写作效率。然而，随之而来的“AI率超标”问题却让不少学生陷入困境——随着各大查重系统不断升级检测算法，AI生成内容被越来越严格地识别和限制。一旦AI率过高&…

李华

超实用Windows 11歌词工具：让任务栏秒变音乐歌词秀场

超实用Windows 11歌词工具：让任务栏秒变音乐歌词秀场【免费下载链接】Taskbar-Lyrics BetterNCM插件，在任务栏上嵌入歌词，目前仅建议Windows 11 项目地址: https://gitcode.com/gh_mirrors/ta/Taskbar-Lyrics 想在工作或学习时随时看…

李华

价值两万美元的复制粘贴失误：当HackerOne“黑”了自己

“价值两万美元的复制粘贴失误：当HackerOne‘黑’了自己” 让我讲述一个我所研究过的最具讽刺意味的安全事件——全球领先的漏洞赏金平台HackerOne，因一个简单的复制粘贴错误，意外地将自己王国的钥匙拱手让人。当时我正在查阅已公开的报告&a…

李华