You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Add support for building CUDA extension on Windows (#396)
* Enable FP6-LLM kernel build on Windows
* fix benchmark script
* update setup.py
* update
* fix indent
* add -t=0 for linux
---------
Co-authored-by: Matthew Douglas <38992547+matthewdouglas@users.noreply.github.com>
Copy file name to clipboardExpand all lines: torchao/csrc/cuda/fp6_llm/kernel_matmul.cuh
+5-4
Original file line number
Diff line number
Diff line change
@@ -12,7 +12,7 @@
12
12
// See the License for the specific language governing permissions and
13
13
// limitations under the License.
14
14
//
15
-
// This file is copied from https://github.com/usyd-fsalab/fp6_llm/blob/ce76774bcfc26b325c1b558abcf1935026d9abbc/fp6_llm/csrc/include/kernel_matmul.cuh
15
+
// This file is modified from https://github.com/usyd-fsalab/fp6_llm/blob/ce76774bcfc26b325c1b558abcf1935026d9abbc/fp6_llm/csrc/include/kernel_matmul.cuh
Copy file name to clipboardExpand all lines: torchao/csrc/cuda/fp6_llm/utils_parallel_dequant.cuh
+17-13
Original file line number
Diff line number
Diff line change
@@ -12,7 +12,8 @@
12
12
// See the License for the specific language governing permissions and
13
13
// limitations under the License.
14
14
//
15
-
// This file is copied from https://github.com/usyd-fsalab/fp6_llm/blob/ce76774bcfc26b325c1b558abcf1935026d9abbc/fp6_llm/csrc/include/utils_parallel_dequant.cuh
15
+
// This file is modified from https://github.com/usyd-fsalab/fp6_llm/blob/ce76774bcfc26b325c1b558abcf1935026d9abbc/fp6_llm/csrc/include/utils_parallel_dequant.cuh
16
+
// To support MSVC, all instances of u_int32_t are changed to uint32_t.
16
17
17
18
#ifndef UTILS_PARALLELDEQUANT_CUH
18
19
#defineUTILS_PARALLELDEQUANT_CUH
@@ -26,7 +27,7 @@
26
27
* Outputs: R1, R2
27
28
* Note: Simplified Exponent calculation is applied.
0 commit comments