• Joined on 2024-06-23
nwpie synced commits to refs/pull/8116/merge at nwpie/llama.cpp from mirror 2024-07-05 09:23:33 +03:00
f09b7cb609 rm get_work_group_size() by local cache for performance (#8286)
a38b884c6c cli: add EOT when user hit Ctrl+C (#8296)
d7fd29fff1 llama : add OpenELM support (#7359)
6f63d646c1 tokenize : add --show-count (token) option (#8299)
Compare 16 commits »
nwpie synced commits to refs/pull/8119/merge at nwpie/llama.cpp from mirror 2024-07-05 09:23:33 +03:00
f09b7cb609 rm get_work_group_size() by local cache for performance (#8286)
Compare 2 commits »
nwpie synced commits to refs/pull/8187/merge at nwpie/llama.cpp from mirror 2024-07-05 09:23:33 +03:00
a38b884c6c cli: add EOT when user hit Ctrl+C (#8296)
d7fd29fff1 llama : add OpenELM support (#7359)
6f63d646c1 tokenize : add --show-count (token) option (#8299)
51d2ebadbb build: Export hf-to-gguf as snakecase
Compare 12 commits »
nwpie synced commits to refs/pull/8199/merge at nwpie/llama.cpp from mirror 2024-07-05 09:23:33 +03:00
a38b884c6c cli: add EOT when user hit Ctrl+C (#8296)
d7fd29fff1 llama : add OpenELM support (#7359)
Compare 3 commits »
nwpie synced commits to refs/pull/8208/merge at nwpie/llama.cpp from mirror 2024-07-05 09:23:33 +03:00
b318a4fe0a Merge cee0464070163d8d21418f409083439e7f556f34 into a38b884c6c
a38b884c6c cli: add EOT when user hit Ctrl+C (#8296)
d7fd29fff1 llama : add OpenELM support (#7359)
Compare 3 commits »
nwpie synced commits to refs/pull/8236/merge at nwpie/llama.cpp from mirror 2024-07-05 09:23:33 +03:00
a38b884c6c cli: add EOT when user hit Ctrl+C (#8296)
d7fd29fff1 llama : add OpenELM support (#7359)
Compare 3 commits »
nwpie synced commits to refs/pull/8249/head at nwpie/llama.cpp from mirror 2024-07-05 09:23:33 +03:00
a3efa29e03 fix bug of minicpm1b,minicpm2b
nwpie synced commits to refs/pull/8252/merge at nwpie/llama.cpp from mirror 2024-07-05 09:23:33 +03:00
a38b884c6c cli: add EOT when user hit Ctrl+C (#8296)
d7fd29fff1 llama : add OpenELM support (#7359)
6f63d646c1 tokenize : add --show-count (token) option (#8299)
51d2ebadbb build: Export hf-to-gguf as snakecase
Compare 12 commits »
nwpie synced commits to refs/pull/8256/merge at nwpie/llama.cpp from mirror 2024-07-05 09:23:33 +03:00
baa832a94a Merge 70bd4124829eab7d43bd089009bdd3cda5c60f1f into f09b7cb609
f09b7cb609 rm get_work_group_size() by local cache for performance (#8286)
a38b884c6c cli: add EOT when user hit Ctrl+C (#8296)
d7fd29fff1 llama : add OpenELM support (#7359)
6f63d646c1 tokenize : add --show-count (token) option (#8299)
Compare 13 commits »
nwpie synced commits to refs/pull/8266/head at nwpie/llama.cpp from mirror 2024-07-05 09:23:33 +03:00
87098db626 rebase work_space api
ac8a4bd9d5 move QK_WARP_SIZE to presets.hpp
d7cf5f5abb revert debug code
870b607c76 add concat support condition
0012f2c149 revert qx_k
Compare 29 commits »
nwpie synced commits to refs/pull/8268/merge at nwpie/llama.cpp from mirror 2024-07-05 09:23:33 +03:00
f09b7cb609 rm get_work_group_size() by local cache for performance (#8286)
Compare 2 commits »
nwpie synced commits to refs/pull/8278/merge at nwpie/llama.cpp from mirror 2024-07-05 09:23:33 +03:00
a38b884c6c cli: add EOT when user hit Ctrl+C (#8296)
Compare 2 commits »
nwpie synced commits to refs/pull/8279/merge at nwpie/llama.cpp from mirror 2024-07-05 09:23:33 +03:00
61ecafa390 passkey : add short intro to README.md [no-ci] (#8317)
aa5898dc53 llama : prefer n_ over num_ prefix (#8308)
6c05752c50 contributing : update guidelines (#8316)
a9554e20b6 [SYCL] Fix WARP_SIZE=16 bug of Intel GPU (#8266)
Compare 7 commits »
nwpie synced commits to refs/pull/8281/merge at nwpie/llama.cpp from mirror 2024-07-05 09:23:33 +03:00
61ecafa390 passkey : add short intro to README.md [no-ci] (#8317)
aa5898dc53 llama : prefer n_ over num_ prefix (#8308)
6c05752c50 contributing : update guidelines (#8316)
a9554e20b6 [SYCL] Fix WARP_SIZE=16 bug of Intel GPU (#8266)
Compare 8 commits »
nwpie synced commits to refs/pull/8283/merge at nwpie/llama.cpp from mirror 2024-07-05 09:23:33 +03:00
a9554e20b6 [SYCL] Fix WARP_SIZE=16 bug of Intel GPU (#8266)
e235b267a2 py : switch to snake_case (#8305)
f09b7cb609 rm get_work_group_size() by local cache for performance (#8286)
a38b884c6c cli: add EOT when user hit Ctrl+C (#8296)
Compare 6 commits »
nwpie synced commits to refs/pull/8295/head at nwpie/llama.cpp from mirror 2024-07-05 09:23:33 +03:00
af514c8d77 sycl : refactored helper headers into multiple files
nwpie synced commits to refs/pull/8295/merge at nwpie/llama.cpp from mirror 2024-07-05 09:23:33 +03:00
a38b884c6c cli: add EOT when user hit Ctrl+C (#8296)
Compare 2 commits »
nwpie synced commits to refs/pull/8304/head at nwpie/llama.cpp from mirror 2024-07-05 09:23:33 +03:00
dc91715b44 llama : minor indentation during tensor loading
aa5898dc53 llama : prefer n_ over num_ prefix (#8308)
6c05752c50 contributing : update guidelines (#8316)
a9554e20b6 [SYCL] Fix WARP_SIZE=16 bug of Intel GPU (#8266)
e235b267a2 py : switch to snake_case (#8305)
Compare 7 commits »
nwpie synced commits to refs/pull/8304/merge at nwpie/llama.cpp from mirror 2024-07-05 09:23:33 +03:00
dc91715b44 llama : minor indentation during tensor loading
aa5898dc53 llama : prefer n_ over num_ prefix (#8308)
6c05752c50 contributing : update guidelines (#8316)
a9554e20b6 [SYCL] Fix WARP_SIZE=16 bug of Intel GPU (#8266)
Compare 7 commits »
nwpie synced commits to refs/pull/8305/head at nwpie/llama.cpp from mirror 2024-07-05 09:23:33 +03:00
91deef4606 py : rename requirements for convert_legacy_llama.py
902de8826b gguf-py : use snake_case in scripts entrypoint export
3e3cc7102f cont : fix link
c172b322c2 cont
Compare 4 commits »