Commit Graph

84 Commits

Author SHA1 Message Date
刘鑫 addee2c550 surport voxcpm2 cli 2026-04-01 21:15:55 +08:00
Labmem-Zhouyx 42c428164c feat: add no_rope support for residual LM and fix streaming continuation decoding
- Add `residual_lm_no_rope` config option in VoxCPMConfig and propagate to MiniCPMModel
- Add `no_rope` field to MiniCPM4Config; make RoPE embedding optional in MiniCPMModel and MiniCPMAttention
- Add `streaming_prefix_len` parameter to generation interface
- Fix non-streaming audio decode in continuation mode to trim leading prefix patches consistently
- Refactor streaming prefix context preparation: distinguish continuation vs. zero-shot via feat_mask trailing bit instead of audio_mask sum

Made-with: Cursor
2026-03-31 17:07:33 +08:00
刘鑫 d9cf376e16 update voxcpm2 2026-03-31 11:50:37 +08:00
刘鑫 23ed7ffeee fix: fix some bugs in resuming multi-GPU training 2026-03-13 18:43:07 +08:00
xliucs 7823e14b82 Merge pull request #188 from haosenwang1018/fix/bare-excepts
fix: use specific exceptions instead of bare except
2026-03-03 11:49:00 +08:00
haosenwang1018 8df79de636 fix: use specific exceptions instead of bare except
- lora_ft_webui.py: except (JSONDecodeError, OSError) for config file
- voxcpm.py: except ImportError for triton availability check
2026-02-24 22:19:45 +00:00
xliucs acaadb19e9 Merge pull request #186 from symhsym/patch-1
Update train_voxcpm_finetune.py
2026-02-11 18:05:39 +08:00
symhsym 07e526a231 Update train_voxcpm_finetune.py
修改了issue#185中提到的问题,在训练时进行validate会对原模型执行to(torch.bfloat16)然后to(torch.float32)的操作,这样可能导致模型数值浮动,因此这个修改让validate步骤保留原模型数值
2026-02-11 11:17:47 +08:00
xliucs 7aadc6c94e Merge pull request #161 from s3ldc/cli-arg-validation
Improve CLI argument validation and help text
2026-01-24 13:06:30 +08:00
Biriy 8f3a91cac8 cli: improve argument validation and help text for VoxCPM CLI 2026-01-20 14:33:58 +05:30
xliucs e72fb42c38 Merge pull request #147 from zanellig/main
Fix README's feature checkboxes
2026-01-19 12:41:22 +08:00
Gonzalo Zanelli 6dd63a534f fix: feature checkboxes 2026-01-18 18:03:24 -03:00
刘鑫 79e75f259e Fix: optimize save ckpt function 2026-01-16 16:22:34 +08:00
刘鑫 e8dd956fc2 Print all log messages to stderr instead of stdout 2026-01-12 15:30:45 +08:00
xliucs db75a7269b Merge pull request #141 from vytskalt/main
Print debug messages to stderr instead of stdout
2026-01-12 15:06:51 +08:00
vytskalt f2e203d5e2 print debug messages to stderr instead of stdout 2026-01-09 20:05:52 +02:00
xliucs 6ecc00a5d3 Merge pull request #139 from lrjerryli/main
Add torchcodec to dependencies
2026-01-04 16:08:09 +08:00
李睿杰JerryLi 8cfd9d155a Add torchcodec to dependencies
ImportError: TorchCodec is required for load_with_torchcodec.
2026-01-02 21:00:23 +08:00
刘鑫 6499215204 Optimize logging validation set results to TensorBoard 2025-12-27 11:49:04 +08:00
xliucs d57ac634f8 Merge pull request #128 from jayll1303/feat/optimize-data-loader
perf: optimize dataset length calculation via batch column access
2025-12-20 14:19:35 +08:00
jayllfpt de11c6a8cb OPTIMIZE: Improve sample length computation by using batch column access 2025-12-20 06:32:39 +07:00
刘鑫 ee5f2567ac FIX:When a prompt is present, concatenate two patches as the context for VAE decoding 2025-12-15 20:37:02 +08:00
刘鑫 b3a2d95fec FIX:When a prompt is present, concatenate two patches as the context for VAE decoding 2025-12-15 20:35:46 +08:00
刘鑫 aabda60833 add lora finetune data setting QA 2025-12-10 20:25:24 +08:00
刘鑫 a266c0a88d add lora funetine webUI; optimize lora save and load logic 2025-12-09 21:34:39 +08:00
Labmem-Zhouyx 0779a93697 Merge branch 'main' of https://github.com/OpenBMB/VoxCPM 2025-12-07 02:02:08 +08:00
Labmem-Zhouyx a1f9d0c3b6 Update: release note 2025-12-07 01:59:53 +08:00
xliucs aefba63f71 Merge pull request #98 from Ayin1412/main
修复lora/ft测试代码处传参错误的内容
2025-12-06 17:38:19 +08:00
Ayin1412 58717d7d82 修复lora/ft测试代码处传参错误的内容 2025-12-06 14:49:35 +08:00
Labmem-Zhouyx 1b0ff5693c Update: model parameters 2025-12-06 01:22:30 +08:00
Labmem-Zhouyx 762815a5b7 Update: user guides 2025-12-05 23:57:43 +08:00
Labmem-Zhouyx 5b13a35ea6 Update: gradio description 2025-12-05 23:47:35 +08:00
Labmem-Zhouyx 3ba727a615 Update: gradio description 2025-12-05 23:38:04 +08:00
Labmem-Zhouyx a7a447b02a Merge branch 'dev_1.5'
# Conflicts:
#	README.md
#	docs/finetune.md
#	scripts/test_voxcpm_ft_infer.py
#	scripts/test_voxcpm_lora_infer.py
#	src/voxcpm/core.py
1.5.0
2025-12-05 22:38:03 +08:00
刘鑫 400f47a516 Modify lora inference api 2025-12-05 22:22:13 +08:00
Labmem-Zhouyx b1f7593ae0 Update: default no denoise & normalize 2025-12-05 22:16:27 +08:00
Labmem-Zhouyx 6a5e713698 fix: streaming mode 2025-12-05 22:06:15 +08:00
Labmem-Zhouyx 3443dbb212 Update: VoxCPM1.5 and fine-tuning supprt 2025-12-05 21:04:51 +08:00
Labmem-Zhouyx 461ad7e506 Update: VoxCPM1.5 and fine-tuning supprt 2025-12-05 21:00:01 +08:00
Labmem-Zhouyx d1bb6aaf41 update technical report 1.0.5 2025-09-30 10:47:39 +08:00
刘鑫 2eb4d39719 FX: Add MPS support 2025-09-28 21:06:35 +08:00
刘鑫 fbf8984d4e Merge branch 'main' into dev 2025-09-27 16:20:47 +08:00
刘鑫 41752dc0fa FX: Raising the Python version to avoid issues with Gradio failing to start. 1.0.4 2025-09-22 21:16:23 +08:00
xliucs b0714adcaa Merge pull request #26 from AbrahamSanders/main
Add a streaming API for VoxCPM
2025-09-22 20:47:07 +08:00
AbrahamSanders 89f4d917a0 Update readme with streaming example 2025-09-19 17:09:30 -04:00
AbrahamSanders 5c5da0dbe6 Add a streaming API for VoxCPM 2025-09-19 16:56:11 -04:00
刘鑫 961569e76d merge from main 2025-09-19 22:08:56 +08:00
刘鑫 5f56d5ff5d FX: update README 2025-09-19 13:44:33 +08:00
xliucs 169c17ddfd Merge pull request #17 from MayDomine/main
add prompt-file option to set prompt text
2025-09-19 13:35:36 +08:00
MayDomine 996c69a1a8 add prompt-file option to set prompt text 2025-09-19 12:53:23 +08:00