Labmem-Zhouyx
0c2cf23617
Update app.py UI, adjust streaming_prefix_len, remove legacy docs
...
- Refine app.py: Ultimate Cloning naming, NFE slider, i18n polish
- Change streaming_prefix_len default from 3 to 4 for smoother decoding
- Remove legacy docs/ directory (migrated to ReadTheDocs)
Made-with: Cursor
2026-04-03 18:42:41 +08:00
Labmem-Zhouyx
b823d8107c
Merge branch 'dev_2.0' of https://github.com/OpenBMB/VoxCPM into dev_2.0
2026-04-03 17:44:46 +08:00
刘鑫
a87739426f
add voxcpm2 finetune conf
2026-04-03 14:23:15 +08:00
Labmem-Zhouyx
12c2b8ff98
update readme
2026-04-02 21:01:23 +08:00
刘鑫
30c300cfe8
adjust default cfg range
2026-04-02 18:14:35 +08:00
刘鑫
addee2c550
surport voxcpm2 cli
2026-04-01 21:15:55 +08:00
Labmem-Zhouyx
42c428164c
feat: add no_rope support for residual LM and fix streaming continuation decoding
...
- Add `residual_lm_no_rope` config option in VoxCPMConfig and propagate to MiniCPMModel
- Add `no_rope` field to MiniCPM4Config; make RoPE embedding optional in MiniCPMModel and MiniCPMAttention
- Add `streaming_prefix_len` parameter to generation interface
- Fix non-streaming audio decode in continuation mode to trim leading prefix patches consistently
- Refactor streaming prefix context preparation: distinguish continuation vs. zero-shot via feat_mask trailing bit instead of audio_mask sum
Made-with: Cursor
2026-03-31 17:07:33 +08:00
刘鑫
d9cf376e16
update voxcpm2
2026-03-31 11:50:37 +08:00
刘鑫
23ed7ffeee
fix: fix some bugs in resuming multi-GPU training
2026-03-13 18:43:07 +08:00
xliucs
7823e14b82
Merge pull request #188 from haosenwang1018/fix/bare-excepts
...
fix: use specific exceptions instead of bare except
2026-03-03 11:49:00 +08:00
haosenwang1018
8df79de636
fix: use specific exceptions instead of bare except
...
- lora_ft_webui.py: except (JSONDecodeError, OSError) for config file
- voxcpm.py: except ImportError for triton availability check
2026-02-24 22:19:45 +00:00
xliucs
acaadb19e9
Merge pull request #186 from symhsym/patch-1
...
Update train_voxcpm_finetune.py
2026-02-11 18:05:39 +08:00
symhsym
07e526a231
Update train_voxcpm_finetune.py
...
修改了issue#185中提到的问题,在训练时进行validate会对原模型执行to(torch.bfloat16)然后to(torch.float32)的操作,这样可能导致模型数值浮动,因此这个修改让validate步骤保留原模型数值
2026-02-11 11:17:47 +08:00
xliucs
7aadc6c94e
Merge pull request #161 from s3ldc/cli-arg-validation
...
Improve CLI argument validation and help text
2026-01-24 13:06:30 +08:00
Biriy
8f3a91cac8
cli: improve argument validation and help text for VoxCPM CLI
2026-01-20 14:33:58 +05:30
xliucs
e72fb42c38
Merge pull request #147 from zanellig/main
...
Fix README's feature checkboxes
2026-01-19 12:41:22 +08:00
Gonzalo Zanelli
6dd63a534f
fix: feature checkboxes
2026-01-18 18:03:24 -03:00
刘鑫
79e75f259e
Fix: optimize save ckpt function
2026-01-16 16:22:34 +08:00
刘鑫
e8dd956fc2
Print all log messages to stderr instead of stdout
2026-01-12 15:30:45 +08:00
xliucs
db75a7269b
Merge pull request #141 from vytskalt/main
...
Print debug messages to stderr instead of stdout
2026-01-12 15:06:51 +08:00
vytskalt
f2e203d5e2
print debug messages to stderr instead of stdout
2026-01-09 20:05:52 +02:00
xliucs
6ecc00a5d3
Merge pull request #139 from lrjerryli/main
...
Add torchcodec to dependencies
2026-01-04 16:08:09 +08:00
李睿杰JerryLi
8cfd9d155a
Add torchcodec to dependencies
...
ImportError: TorchCodec is required for load_with_torchcodec.
2026-01-02 21:00:23 +08:00
刘鑫
6499215204
Optimize logging validation set results to TensorBoard
2025-12-27 11:49:04 +08:00
xliucs
d57ac634f8
Merge pull request #128 from jayll1303/feat/optimize-data-loader
...
perf: optimize dataset length calculation via batch column access
2025-12-20 14:19:35 +08:00
jayllfpt
de11c6a8cb
OPTIMIZE: Improve sample length computation by using batch column access
2025-12-20 06:32:39 +07:00
刘鑫
ee5f2567ac
FIX:When a prompt is present, concatenate two patches as the context for VAE decoding
2025-12-15 20:37:02 +08:00
刘鑫
b3a2d95fec
FIX:When a prompt is present, concatenate two patches as the context for VAE decoding
2025-12-15 20:35:46 +08:00
刘鑫
aabda60833
add lora finetune data setting QA
2025-12-10 20:25:24 +08:00
刘鑫
a266c0a88d
add lora funetine webUI; optimize lora save and load logic
2025-12-09 21:34:39 +08:00
Labmem-Zhouyx
0779a93697
Merge branch 'main' of https://github.com/OpenBMB/VoxCPM
2025-12-07 02:02:08 +08:00
Labmem-Zhouyx
a1f9d0c3b6
Update: release note
2025-12-07 01:59:53 +08:00
xliucs
aefba63f71
Merge pull request #98 from Ayin1412/main
...
修复lora/ft测试代码处传参错误的内容
2025-12-06 17:38:19 +08:00
Ayin1412
58717d7d82
修复lora/ft测试代码处传参错误的内容
2025-12-06 14:49:35 +08:00
Labmem-Zhouyx
1b0ff5693c
Update: model parameters
2025-12-06 01:22:30 +08:00
Labmem-Zhouyx
762815a5b7
Update: user guides
2025-12-05 23:57:43 +08:00
Labmem-Zhouyx
5b13a35ea6
Update: gradio description
2025-12-05 23:47:35 +08:00
Labmem-Zhouyx
3ba727a615
Update: gradio description
2025-12-05 23:38:04 +08:00
Labmem-Zhouyx
a7a447b02a
Merge branch 'dev_1.5'
...
# Conflicts:
# README.md
# docs/finetune.md
# scripts/test_voxcpm_ft_infer.py
# scripts/test_voxcpm_lora_infer.py
# src/voxcpm/core.py
1.5.0
2025-12-05 22:38:03 +08:00
刘鑫
400f47a516
Modify lora inference api
2025-12-05 22:22:13 +08:00
Labmem-Zhouyx
b1f7593ae0
Update: default no denoise & normalize
2025-12-05 22:16:27 +08:00
Labmem-Zhouyx
6a5e713698
fix: streaming mode
2025-12-05 22:06:15 +08:00
Labmem-Zhouyx
3443dbb212
Update: VoxCPM1.5 and fine-tuning supprt
2025-12-05 21:04:51 +08:00
Labmem-Zhouyx
461ad7e506
Update: VoxCPM1.5 and fine-tuning supprt
2025-12-05 21:00:01 +08:00
Labmem-Zhouyx
d1bb6aaf41
update technical report
1.0.5
2025-09-30 10:47:39 +08:00
刘鑫
2eb4d39719
FX: Add MPS support
2025-09-28 21:06:35 +08:00
刘鑫
fbf8984d4e
Merge branch 'main' into dev
2025-09-27 16:20:47 +08:00
刘鑫
41752dc0fa
FX: Raising the Python version to avoid issues with Gradio failing to start.
1.0.4
2025-09-22 21:16:23 +08:00
xliucs
b0714adcaa
Merge pull request #26 from AbrahamSanders/main
...
Add a streaming API for VoxCPM
2025-09-22 20:47:07 +08:00
AbrahamSanders
89f4d917a0
Update readme with streaming example
2025-09-19 17:09:30 -04:00