Comments (2)
root@1a89b5aa5fce:/opt/hipBLASLt/build/release# ./clients/staging/hipblaslt-test
hipBLASLt version: 300
Query device success: there are 1 devices
Device ID 0 : AMD Radeon VII gfx906:sramecc+:xnack-
with 17.2 GB memory, max. SCLK 1801 MHz, max. MCLK 1000 MHz, compute capability 9.0
maxGridDimX 2147483647, sharedMemPerBlock 65.5 KB, maxThreadsPerBlock 1024, warpSize 64
info: parsing of test data may take a couple minutes before any test output appears...
[==========] Running 10091 tests from 2 test suites.
[----------] Global test environment set-up.
[----------] 10046 tests from _/matmul_test
[ RUN ] _/matmul_test.matmul/pre_checkin_matmul_bad_arg_bad_arg
[ OK ] _/matmul_test.matmul/pre_checkin_matmul_bad_arg_bad_arg (240 ms)
[ RUN ] _/matmul_test.matmul/pre_checkin_matmul_bad_arg_bad_arg_t2
[ OK ] _/matmul_test.matmul/pre_checkin_matmul_bad_arg_bad_arg_t2 (0 ms)
[ RUN ] _/matmul_test.matmul/pre_checkin_matmul_bad_arg_bad_arg_t3
[ OK ] _/matmul_test.matmul/pre_checkin_matmul_bad_arg_bad_arg_t3 (0 ms)
[ RUN ] _/matmul_test.matmul/pre_checkin_alpha_beta_zero_NaN_f16_rf16_rf16_rf16_rf32_r_NN_256_128_64_nnan_256_64_nnan_256_256_1
rocblaslt warning: No paths matched /opt/hipBLASLt/build/release/library/../Tensile/library/gfx906co. Make sure that HIPBLASLT_TENSILE_LIBPATH is set correctly.
/opt/hipBLASLt/clients/gtest/../include/unit.hpp:208: Failure
Expected equality of these values:
float(hCPU[i + j * size_t(lda) + k * strideA])
Which is: 0
float(hGPU[i + j * size_t(lda) + k * strideA])
Which is: 0.0050582886
[ FAILED ] _/matmul_test.matmul/pre_checkin_alpha_beta_zero_NaN_f16_rf16_rf16_rf16_rf32_r_NN_256_128_64_nnan_256_64_nnan_256_256_1, where GetParam() = { function: "matmul", name: "alpha_beta_zero_NaN", category: "pre_checkin", known_bug_platforms: "", alpha: -nan, beta: -nan, stride_a: 16384, stride_b: 8192, stride_c: 32768, stride_d: 32768, stride_e: 32768, user_allocated_workspace: 0, M: 256, N: 128, K: 64, lda: 256, ldb: 64, ldc: 256, ldd: 256, lde: 256, batch_count: 1, iters: 10, cold_iters: 2, algo: 0, solution_index: 0, a_type: f16_r, b_type: f16_r, c_type: f16_r, d_type: f16_r, compute_type: f32_r, scale_type: f32_r, initialization: "rand_int", gpu_arch: "", pad: 4096, grouped_gemm: 0, threads: 0, streams: 0, devices: (5836 ms)
[ RUN ] _/matmul_test.matmul/pre_checkin_alpha_beta_zero_NaN_f16_rf16_rf16_rf16_rf32_r_NN_256_128_64_nnan_256_64_2_256_256_1
/opt/hipBLASLt/clients/gtest/../include/unit.hpp:208: Failure
Expected equality of these values:
float(hCPU[i + j * size_t(lda) + k * strideA])
from hipblaslt.
@idreamerhx hipblaslt currently only support gfx90a device.
https://github.com/ROCmSoftwarePlatform/hipBLASLt/blob/develop/README.md#hardware-requirements
from hipblaslt.
Related Issues (19)
- Build Failure During Tensile Libraries Generation HOT 2
- Would it be possible for hipBLASLt to support int8 ops?
- /usr/lib/gcc/
- cublasLtMatrixTransform() equivalent HOT 3
- ValueError: mutable default <class 'Tensile.KernelWriter.ABMatrixInfo'> for field a is not allowed: use default_factory HOT 1
- [Feature-request] support amd gpus without matrix cores in rocBLASLt HOT 1
- hipblasLtMatmul api usage HOT 2
- FP8 support HOT 1
- [Issue]: hipBLASLt support for more GPUs for PyTorch with ROCm 5.7 or later HOT 2
- [Issue]: Dependency on Tensile headers HOT 3
- Install guide for non-root user HOT 1
- AttributeError: 'NoneType' object has no attribute 'solutions' HOT 4
- Any plans for adding gfx10+ support?
- hipBlastLT build failed with msgpack error even though it is installed HOT 4
- [Issue]: Build fails without showing any details while building TENSILE_LIBRARY_TARGET HOT 1
- No mechanism to turn header off in example_hipblaslt_preference
- [Issue]: install.sh is not friendly with `&>` log file redirection HOT 3
- [Issue]: Missing tensile Benchmark Problem config files
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from hipblaslt.